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Chapter 1 

Goldstone Bosons 


Goldstone bosons are weakly-coupled states which appear in the low-energy spectrum of 
any system for which a rigid (or global) symmetry is spontaneously broken (that is, the 
symmetry is not preserved by the system’s ground state). A great deal is known about 
the properties of these bosons, since at low energies their properties are largely governed 
by the nature of the symmetries which are spontaneously broken, depending only weakly 
on the details of the system itself. 

This review is devoted to explaining the modern effective lagmngian method for 
identifying Goldstone boson properties. These methods are much more efficient than are 
the older current-algebra techniques of yore. 

1.1 Introduction 

It is a common feature of many physical systems that their behaviour is relatively simple 
when examined only at low energies (or temperatures) compared to the system’s own 
characteristic scales. It often happens that there are relatively few states which can 
participate in low-energy processes, and their interactions can sometimes become less and 
less important the lower the energies that are examined. Very general theoretical tools 
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exist to exploit this simplicity, when it arises. 

One such tool is the technique of effective lagrangians. The guiding idea for this 
method is the belief, hrst clearly enunciated by Weinberg, that there is no loss of generality 
in using a held theory to capture the low-energy behaviour of any system. This is because 
held theory in itself contains very little content beyond ensuring the validity of general 
‘motherhood’ properties like unitarity, cluster decomposition, and so on. According to 
this point of view, if a held theory is identihed which is the most general consistent with 
the low-energy degrees of freedom and symmetries of any particular system (together with 
a few ‘motherhood’ properties) then this held theory must provide a good description of 
the system’s low-energy limit. 

This is a particularly useful observation when the low-energy degrees of freedom 
are weakly interacting [regardless of how strongly interacting their higher-energy coun¬ 
terparts might be), because then the resulting held theory may be simple enough to be 
used to predict explicitly the system’s low-energy properties. This simplicity is some¬ 
what paradoxical since, as we shall see, the low-energy ehective lagrangians are typically 
very complicated, involving all possible powers of the various helds and their deriva¬ 
tives. Simplicity is achieved in spite of the complicated ehective lagrangian because, for 
weakly-coupled theories, general power-counting arguments exist which permit an efficient 
identihcation of the comparatively few interactions which appear at any given order in a 
low-energy expansion. 

Remarkably, there turns out to be a very important situation for which very gen¬ 
eral results are known concerning the existence of very light degrees of freedom whose 
low-energy interactions are weak. This occurs whenever a continuous global symmetry is 
spontaneously broken [i.e. which is a symmetry of the hamiltonian but not a symmetry 
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of the ground state), since when this happens Goldstone’s theorem guarantees the ex¬ 
istence of low-energy Goldstone bosons, as well as determining a great deal about their 
interactions. These theorems, and their description in terms of an effective lagrangian 
formulation, are the subject of this review. 

1.1.1 A Road Map 

This section outlines how the material covered in this review is organized. 

1. General Formalism: All of the general results may be found in Ghapter 1, starting 
with a statement of the key theorems — those of Noether and Goldstone — which 
underlie everything else. This is followed by a motivational discussion of the sim¬ 
plest example to which Goldstone’s theorem applies. Although the properties of the 
Goldstone bosons are guaranteed by general theorems, the moral of the example is 
that these properties are generally not manifest in a low-energy effective lagrangian 
unless a special choice of variables is used. These variables are identihed and ex¬ 
ploited first for spontaneously-broken abelian internal symmetries, and then the 
process is repeated for nonabelian internal symmetries. Both lorentz-invariant and 
nonrelativistic systems are considered. For the nonrelativistic case, special attention 
given to the breaking of time reversal, since this qualitatively affects the nature of 
the low-energy effective lagrangian. The spontaneous breaking of spacetime sym¬ 
metries, like rotations, translations and lorentz transformations, is not discussed in 
this review. 

2. Applications: Ghapters 2 throngh 4 are devoted to specihc applications of the 
methods of Ghapter 1 to examples in high-energy/nnclear and condensed-matter 
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physics. Chapter 2 starts with the classic relativistic example of pions as pseudo- 
Goldstone bosons, whose study gave birth to many of the techniques described in 
Chapter 1. (A pseudo-Goldstone boson is the Goldstone boson for an approximate 
symmetry, as opposed to an exact symmetry.) This is followed in Chapter 3 by a 
study of spin waves (magnons) in both ferromagnets and antiferromagnets. Chapter 
4 then closes with a recent, more speculative, application of these ideas to the S'0(5) 
proposal for the high-temperature superconductors. 

3. Bibliography: Finally, Chapter 5 contains a brief bibiography. It is not meant 
to be exhaustive, as a great many articles have emerged over the past decades of 
applications of these methods. I therefore restrict myself to listing those papers and 
reviews of which I am most familiar. I apologize in advance to the authors of the 
many excellent articles I have omitted. 

The review is aimed at upper-year graduate students, or practicing researchers, since 
it presupposes a familiarity with quantum held theory. It was written with an audience 
of high-energy and nuclear phycisists in mind, and so for the most part units are used for 
which h = c = 1. However, I hope it will prove useful to condensed-matter physicists as 
well. Enjoy! 

1.2 Noether’s Theorem 

We start with a statement of Noether’s theorem, since this plays a role in the statement 
of Goldstone’s theorem, which is the main topic of this chapter. 

For a held theory Noether’s theorem guarantees the existence of a conserved current, 
for every global continuous symmetry of the action. To low orders in the derivative 
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expansion it is nsnally enongh to work with actions which depend only on the fields and 
their first derivatives, so we restrict onr statement of the theorem to this case. 

Consider therefore a system governed by an action S = J d'^x C{cj), 9^0), where 0(x) 
generically denotes the fields relevant to the problem. We imagine that S is invariant 
nnder a set of transformations of these fields, S(j) = ^a{4>) where a;“ denote a collection 
of independent, spatially constant symmetry parameters. Invariance of S implies that the 
lagrangian density, £, must vary at most into a total derivative: 


sc = a, (^“vr), 


( 1 . 2 . 1 ) 


for some quantities This equation is meant to hold as an identity, for arbitrary 

field configurations, 0, and for arbitrary constant parameters, uj°‘. Rewriting the variation 
of C directly in terms of the variations of the fields, and equating to zero the coefficient 
of the arbitrary constant a;“ in the result then gives: 


d£ 

= yrrCa 


dC 


d(j) d{d^(j)) 




TT-- 0,1 


dC 


+ dn 


dC 




( 1 . 2 . 2 ) 


The statement of the theorem follows from this last equation. It states that the 


quantities 


satisfy the special property 


7^ = 
J a 


dC 




+ KT, 


(1.2.3) 


= 0, 


(1.2.4) 


when they are evaluated at a solution to the equations of motion for S — i.e. on field 
configurations for which the square bracket on the right-hand-side of eq. ([1.2.2D vanishes. 
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Even though we have used relativistic notation in this argument, the conclusion, 


eq. (|1.2.4|) , is equally valid for nonrelativistic systems. For these systems, if we write 
pa = ia the temporal component of j^, and denote its spatial components by the three- 
vector j(j, then current conservation (eq. ( p..2.4|) ) is equivalent to the familiar continuity 
equation 

' V-ja = 0. (1.2.5) 


dt 

Eq. (|1.2.4|) or eq. (|1.2.5|) , are called conservation laws because they guarantee that 
the charges, Qa, dehned by 


Qa{t) = f dh Pair, t) = f dh faix), ( 1 . 2 . 6 ) 

J fixed t J 

are conserved in the sense that they are independent of t. These charges are sometimes 
called the generators of the symmetry because their commutator with the helds give the 
symmetry transformations themselves 




(1.2.7) 


The existence of such a conserved current carries special information if the symmetry 
involved should be spontaneously broken, as we now describe. 

1.3 Goldstone’s Theorem 


Whenever the ground state of a system does not respect one of the system’s global con¬ 
tinuous symmetries, there are very general implications for the low-energy theory. This 
is the content of Goldstone’s theorem, which we now present. This theorem is central 
to the purpose of this chapter, which is devoted to making its implications manifest in a 
low-energy effective theory. 
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Goldstone’s theorem states that any system for which a continuous, global symmetry 
is spontaneously broken, must contain in its spectrum a state, |G) — called a Goldstone 
mode, or Goldstone boson since it must be a boson|^ — which has the dehning property 
that it is created from the ground state by performing a spacetime-dependent symmetry 
transformation. In equations, |G) is dehned by the condition that the following matrix 
element cannot vanish;^ 

{G\p{r,tm^0. (1.3.1) 

Here, |fl) represents the ground state of the system, and p = is the density for the 
conserved charge — guaranteed to exist by Noether’s theorem — for the spontaneously 
broken symmetry. 

Before turning to its implications, we outline the proof of this result. The starting 
point is the assumption of the existence of a local order parameter. This can be dehned 
to be a held, in the problem which satishes two dehning conditions. Firstly, it 

transforms nontrivially under the symmetry in question: i.e. there is another held, 'ip{x), 
for which: 

di/j = i[Q,'ijj{x)] = (j){x). (1.3.2) 

Q is the conserved charge dehned by integrating the density p(r, t) throughout all of space. 
Secondly, the held 0 must have a nonzero expectation in the ground state: 


(0) = (fl|0(a;)|fl) = V jtz 0, 


(1.3.3) 


This last condition would be inconsistent with eq. ( [1.3.2|) if the ground state were invariant 


^Supersymmetry is an exception to this statement, since spontaneously broken global supersymmetry 
ensures the existence of a Goldstone fermion, the goldstino. 

^We use nonrelativistic notation here to emphasize that the conclusions are not specific to relativistic 
systems. This will prove useful when nonrelativistic applications are considered in later sections. 




under the symmetry of interest, since this would mean Q|f2) = 0, implying the right-hand- 


side of eq. ( p..3.3|) must vanish. 

The proof of the theorem now proceeds from the following steps, (z) Substitute 
eq. (|1.3.2|) into eq. (|1.3.3| ); (u) Use Q = J p cPr in the result, as is guaranteed to be 
possible by Noether’s theorem; (Hi) Insert a partition of unity as a sum over energy 
eigenstates, 1 = either side of the operator p. The resulting expression 

shows that if no energy eigenstate exists which satishes the dehning condition, eq. ( |1.3.1| ), 
then the right-hand-side of eq. (11.3.31) must vanish, in contradiction with the starting 
assumptions. This proves the theorem. We next elaborate on its consequences. 


The dehning matrix element, eq. (|1.3.1| ), and the conservation law, eq. ( p..2.5|) , to¬ 
gether imply that Goldstone bosons must have a number of important properties. Besides 
determining their spin and their statistics, it implies two properties which are of particular 
importance: 

1. The Goldstone boson must be gapless, in that its energy must vanish in the limit 
that its (three-) momentum vanishes. That is: 


lim E{p) = 0. 

p^O 


(1.3.4) 

To see why this follows from eq. ( 1.3.1| ), it is helpful to make the dependence 
on position and time in this equation explicit by using the identities pa{r,t) = 
Pa(i', 0) and ja(r,t) = ja(0, t) together with the energy- and 

momentum-eigenstate conditions: H\fl) = P|G) = 0, P|G(p)) = p|G(p)) and 


H\G{p)) = Ep\G{p)). Then, differentiation of eq. (|1.3.1|) with respect to t, and 
use of the continuity equation, eq. (|1.2.5|) , gives: 


-zEpe-^^^^ {G\pa{r,0m = (G|^(r,t)|G) 


^dpa 

dt 
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= - {G\V-Ur,t)\n) (1.3.5) 

= -^P • (G'|ja(r,t)|ll). 


Eq. (|1.3.4|) follows from this last equality in the limit p —»■ 0, given that the matrix 
element, (G|pa(r, 0)|f2), cannot vanish by definition for a Goldstone boson. 

In relativistic systems, for which E{p) = y/p'^ + m'^ where m is the particle mass, 
the gapless condition, eq. (|1.3.4|) , is equivalent to the masslessness of the Goldstone 
particle. 


2. More generally, the argument just made can be extended to more complicated matrix 
elements. One hnds in this way that the Goldstone boson for any exact symmetry 
must completely decouple from all of its interactions in the limit that its momentum 


vanishes. Physically, this is because eq. (|1.3.1|) states that in the zero-momentum 
limit the Goldstone state literally is a symmetry transformation of the ground state. 
As a result it is completely indistinguishable from the vacuum in this limit. 


These properties have a lot of implications for the low-energy behaviour of any 
system which satishes the assumptions of the theorem. The hrst guarantees that the 
Goldstone boson must itself be one of the light states of the theory, and so it must be 
included in any effective lagrangian analysis of this low energy behaviour. The second 
property ensures that the Goldstone mode must be weakly coupled in the low-energy 
limit, and strongly limits the possible form its interactions can take. 

The properties of gaplessness and low-energy decoupling can also be useful even if 
the spontaneously broken ‘symmetry’ in question is really not an exact symmetry. To the 
extent that the symmetry-breaking terms, ifsb, of the system’s Hamiltonian are small, the 
symmetry may be regarded as being approximate. In this case the violation of the gapless 
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and decoupling properties can usefully be treated perturbatively in i^gb- The Goldstone 
particles for any such approximate symmetry — called pseudo-Goldstone bosons — are 
then systematically light and weakly coupled at low energies, instead of being strictly 
massless or exactly decoupled. 

The purpose of the remainder of this chapter is to show in detail how these properties 
are encoded into the low-energy effective lagrangian. By considering simple examples we 
hnd that although these properties are always true, they need not be manifest in the 
lagrangian in an arbitrary theory. They can be made manifest, however, by performing 
an appropriate held redehnition to a standard set of held variables. We hrst identify 
these variables, and use them to extract the implications of Goldstone’s theorem for the 
low-energy ehective theory in the simplest case, for which the symmetry group of interest 
is abelian. The results are then generalized in subsequent sections to the nonabelian case. 

1.4 Abelian Internal Symmetries 

In order to see the issues which are involved, it is instructive to consider a simple held 
theory for which a symmetry is spontaneously broken. We therefore hrst consider a simple 
model involving a single complex scalar held, 0. 

1.4.1 A Toy Example 

The lagrangian density: 

C = - d^(t)*d^^(t) - I/(0», 

with ^ ^ 

is invariant with respect to a U{1) group of symmetries: 0 —> e*" 0. This is a global 
symmetry because the term involving derivatives of (j) is only invariant if the symmetry 
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parameter, a, is a constant throughout spacetime. It is called an internal symmetry since 
the symmetry acts only on helds and does not act at all on the spacetime coordinate, x^. 
For later reference, the Noether current for this symmetry is: 

J'm = - 4> . (1-4.2) 


For small A this system is well approximated by a semiclassical expansion, provided 
that the held 0 is O This may be seen by redehning 0 = 0 /V^, and noticing 

that all of the A-dependence then scales out of the lagrangian: £(0, p. A) = y/l(0, p, 1) — 
for which the limit A —0 is seen to be equivalent to h —0 in the semiclassical limit. 

The vacuum of the theory is therefore well described, for small A, by the classical 
conhguration of minimum energy. Since the classical energy density is a sum of positive 
terms, 7^ = 0*0 +V0* ■ V0 + 1^(0*0), it is simple to minimize. The vacuum conhguration 
is a constant throughout spacetime, 0 = V0 = 0, and its constant value, 0 = n, must 
minimize the classical potential; V{v*v) = 0. We may use the U{1) symmetry to choose 
V to be real, and if is positive then the solution becomes v = Happily this 

conhguration lies within the conditions of validity of this semiclassical analysis. 

Since the vacuum conhguration, 0 = n 7 ^ 0, is not invariant under the U{1) transfor¬ 
mations, 0 —> e*"0, the U{1) symmetry is seen to be spontaneously broken. Goldstone’s 
theorem should apply, and so we now identify the Goldstone degree of freedom. 

The spectrum may be identihed by changing variables to the real and imaginary 
parts of the deviation of the held 0 from its vacuum conhguration. Dehning TZ = 
Re (0 — v) and X = \/2 Im 0 diagonalizes the kinetic and mass terms, and the 
scalar potential in terms of these variables becomes: 
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Figure 1.1: The Feynman graphs which describe TZ—I scattering at tree level. Solid lines 
denote TZ and dashed lines represent I. 

where the couplings and masses in this potential are given in terms of the original param¬ 
eters, A and /r, by: 


2 \ 2 5'30 fi'12 An 

^ ^ ’ 3! 2 2^2 


dm _ 9^ _ 922 _ A 

IT “ IT “ T “ 16‘ 


(1.4.4) 


Notice the existence of a massless held, X, as is required by Goldstone’s theorem. We 
can verify that X really is the Goldstone boson by writing the Noether current, eq. ( |1.4.2| ), 
in terms of the mass eigenstates, TZ and X: 


= vV2 d^I + (R -1 S„R). 


(1.4.5) 


Glearly the matrix element: 


(X(p)|j^(x)|0) oc vV2 e 


(1.4.6) 


does not vanish (unless u = 0), as is required of a Goldstone boson. 

A puzzle with the potential of eqs. ( p..4.3|) and ( |1.4.4|) is that the Goldstone boson, 
X, appears in the scalar potential, and so its couplings do not appear to vanish in the 
limit of vanishing momentum. This is only an appearance, however, and X really does 
decouple at low energies, as can be tested by computing Goldstone boson scattering in 
this limit. For example, the S-matrix at tree level for I — TZ scattering may be computed 
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by evaluating the Feynman graphs of Fig. (1.1). The result is: 


S[n{r) +I{s) ^ 7^(r') +X(s')] = 


iA 6^{r + s — r' — s') 
(27r) 2 


with 


(1.4.7) 


A — — g22 + ■ 


912 930 


77 + 9i2 


1 

1 

1 

(s -|- r)2 — ie 

(s - r')2 - ie_ 


{s + s')^ + — ie 

In the limit s^, s'^ —>■ 0 this becomes (using the condition A = = —m 

912 930 ‘^9x2 


A 


922 


mi 


mi 




(1.4.8) 


(1.4.9) 


The scattering amplitude indeed vanishes in the zero-momentum limit, as it must 
according to Goldstone’s theorem. This vanishing is not manifest in the lagrangian, 
however, and is only accomplished through a nontrivial cancellation of terms in the S- 
matrix. For many purposes, not least when constructing an effective theory to describe 
the low energy interactions of the Goldstone bosons, it would be preferable to have this 
decoupling be manifest in the lagrangian. We will now do so, by making a held redehnition 
to a new set of variables for which decoupling becomes explicit. 


1.4.2 A Better Choice of Variables 


In order to identify which variables would make the decoupling of Goldstone bosons 
more explicit in the lagrangian, it is useful to recall the dehnition of what the Goldstone 


mode physically is. Its dehning condition, eq. (|1.3.1|) , can be interpreted to mean that 
the Goldstone modes are obtained from the ground state by performing a symmetry 
transformation, but with a spacetime dependent transformation parameter. 
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In the example considered in the previous section the ground state conhguration is 
(j) = V, and so a local symmetry transformation of this ground-state would be 0 = If 

this is substituted into the lagrangian of eq. ( |1.4.1| ), we hnd = —v'^d^Od^O. 

9 does not drop out of the problem because, although the lagrangian vanishes when it 
is evaluated at 0 = n, the conhguration is only related to 0 = n by a symmetry 

when 6^ is a constant. This fact that 9 parameterizes a symmetry direction when it is 
restricted to constant held conhgurations guarantees that any ^-dependence of C must 
involve at least one derivative of 9, thereby dropping out of the problem in the limit of 
small derivatives — i.e. small momenta, or long wavelengths. 

All of this suggests that 9 would make a good representation for the Goldstone 
mode, since this is precisely what a Goldstone mode is supposed to do: decouple from 
the problem in the limit of small momenta. We are led to the suggestion of using polar 
coordinates in held space, 

(j){x) = x(x) (1.4.10) 

in order to better exhibit the Goldstone boson properties. In this expression both 9 and 
X are dehned to be real. Substituting this into the lagrangian gives: 

-V(X^)- (1.4.11) 

It is clear that these variables do the trick, since the fact that 9 appears in the dehnition, 
eq. (|1.4.10|) , in the same way as does a symmetry parameter guarantees that it completely 
drops out of the scalar potential, as must a Goldstone boson if its low-energy decoupling 
is to be made explicit. 

A price has been paid in exchange for making the low-energy decoupling of the 
Goldstone boson explicit, however. This price is most easily seen once the helds are 
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canonically normalized, which is acheived by writing y = n + ^ y' and 6 = cp. 
With these variables the lagrangian is seen to have acquired nominally nonrenormalizable 
interactions: 

Of course, the S'-matrix for the theory in these variables is identical to that derived from 
the manifestly renormalizable lagrangian expressed in terms of the variables TZ and X. So 
the S'-matrix remains renormalizable even when computed using the variables y' and p. 
(The same is not true for off-shell quantities like Green’s functions, however, since the 
renormalizability of these quantities need not survive a nonlinear held redehnition.) 

In this toy model there is therefore a choice to be made between making the la¬ 
grangian manifestly display either the renormalizability of the theory, or the Goldstone 
boson nature of the massless particle. Which is best to keep explicit will depend on which 
is more convenient for the calculation that is of interest. Since, as we shall see, renormal¬ 
izability is in any case given up when dealing with effective low-energy held theories, it is 
clear that the variables which keep the Goldstone boson properties explicit are the ones 
of choice in this case. 

1.4.3 The General Formulation 

The reason why the above redehnition works may be seen by asking how the U{1) symme¬ 
try acts on the new variables. The key observation is that the symmetry transformation 
becomes inhomogeneous-. 6 ^ 6 a, where a is a constant. In terms of the canonically 
normalized held, p, this transformation law becomes: 

p^p-{-V2va. (1.4.13) 
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This kind of transformation rule is the hallmark of a Goldstone boson, since it enforces 
the explicit nature of all of the Goldstone boson properties in the lagrangian. In fact — 
as can be expected from the generality of Goldstone’s theorem — they can all be derived 
purely on the grounds of this symmetry transformation, and do not rely at all on the 
details of the underlying model which motivated its consideration. 

To show that this is true, imagine writing an arbitrary effective theory for a real 
scalar held, (p, subject only to the symmetry of eq. ( p..4.13|) (and, for simplicity, to Poincare 
invariance). The most general lagrangian which is invariant under this transformation is 
an arbitrary function of the derivatives, of the held. An expansion in interactions of 
successively higher dimension then gives: 




(1.4.14) 


where we have inserted a power of v as appropriate to ensure that the parameter a is 
dimensionless. This accords with the expectation that it is the symmetry-breaking scale, 
n, which sets the natural scale relative to which the low energy limit is to be taken. In 
the toy model just considered, integrating out the heavy held, x' produces these powers 
of V through the appearance of the inverse of the heavy mass, rriji. The result must be an 
ehective lagrangian of the form of eq. (|1.4.14|) , but with a specihc, calculable coefficient 
for the parameter a. 

This, most general, lagrangian automatically ensures that ip has all of the Goldstone 
boson properties. For instance, since the symmetry implies that /I®® can only depend on 
derivatives of </9, it ensures that p cannot appear at all in the scalar potential, and so in 
particular ensures that p is massless. Similarly, applying Noether’s theorem to the kinetic 
term for p implies that there is a contribution to the Noether current, j^, which is linear 
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in ip: 

f = V2v Wip + -{d''ipdup>)d^^ip + --] . (1.4.15) 

L V \ 

The ellipses in this expression represent contributions to which come from other terms 
in the lagrangian besides the ip kinetic term. Clearly this ensures that the matrix element 
(G|j^|0) 7 ^ 0 so long as V 7 ^ 0. 

Such an understanding of the Goldstone nature of a held, like ip, as an automatic 
consequence of a symmetry is clearly invaluable when constructing effective lagrangians for 
systems subject to spontaneous symmetry breaking. We next turn to the generalization 
of these results to the more general case of nonabelian internal symmetries. 

1.5 Nonabelian Internal Symmetries 

The lesson learned from the abelian example is that half of the art of constructing effective 
lagrangians for Goldstone bosons lies in the choice of a convenient set of variables in terms 
of which their properties are explicitly displayed in the lagrangian. In this section the 
above construction is generalized to the case of nonabelian, global, internal symmetries. 

1.5.1 A Second Toy Model 

As guidance towards an appropriate choice of held variables we once more start with a 
simple toy model for which the underlying theory is explicitly known. Gonsider, therefore, 
a system of N real scalar helds, 0*,? = l,...,iV, which for simplicity of notation we 
arrange into an iV-component column vector, denoted by 0 with no superscript. Notice 
that there is no loss of generality in working with real helds, since any complex helds could 
always be decomposed into real and imaginary parts. We take as lagrangian density 
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where the superscript ‘T’ denotes the transpose, and where V^(0) is a potential whose 
detailed form is not required. The kinetic term of this lagrangian is manifestly invariant 
under the Lie group, 0{N), of orthogonal rotations amongst the N real helds: 0 — 0(j), 
where the O’s are independent of spacetime position, <9^0 = 0, and 0^0 = 1. In general, 
the potential V{(j)) need not be also invariant under these 0{N) transformations, but may 
only preserve some subgroup of these, G C 0{N). That is, if g E G, then V^gcj)) = 1^(0) 
for all helds 0 . 

Suppose now that for some regime of parameters this model is well described by 
the semiclassical approximation, and further that the potential, V, is minimized for some 
nonzero value for the helds: 0 = n 7 ^ 0. If this is the case, then the symmetry group G 
may be spontaneously broken to some subgroup, H <Z G, which is dehned by: hv = v, for 
all heH. 

1.5.2 A Group-Theoretic Aside 

It is important to notice that the current whose existence is guaranteed by Noether’s 
theorem — and so which plays the central role in Goldstone’s theorem — arises only if 
the symmetry of interest is continuous. Continuous here means that the group elements 
may be parameterized by a continuous parameter (like a rotation angle), as opposed 
to a discrete label. Gronps with continnous labels are called Lie groups provided their 
labels are sufficiently smooth. Before proceeding it is usefnl to pause to record some 
mathematical properties of such Lie groups, and their associated Lie algebras. 

1. Typically the continuous symmetry groups which arise in physical applications do 
so as explicit hnite-dimensional nnitary matrices. As a resnlt a special role is played 
by compact gronps, for which the parameter space of the gronp is a compact set. 


19 



Compact groups are of such special interest since it is only for compact groups that 
hnite-dimensional, unitary and faithful matrix representations existWe assume 
compact groups throughout what follows, and we work explicitly with representa¬ 
tions involving hnite-dimensional and unitary matrices, = g~^. 


2. There is also no loss of generality in assuming our representation matrices, 5 ^, to be 
real: g = g*. This is because any complex representation may always be decomposed 
into its real and imaginary parts. This convention is ensured in the scalar-held 
example we are considering by choosing to employ only real helds. We do not 
assume these matrices to be irreducible. Recall that if the matrices are reducible, 
then there is a basis in which they can be written in a block-diagonal form: 


f9ii) 

' 9{n) 


(1.5.2) 


3. It is useful to phrase much of what follows in terms of the Lie algebra of G and H 
rather than in terms of the Lie groups themselves. That is, we take advantage of 
the fact that any group element which is connected to the identity element, g = 1, 
may be written as a matrix exponential: g = exp [ia°'Ta], of a linear combination 
of a collection of basis matrices, or generators, T^, a = 1, ...,d where d is called the 
dimension of the group. The T^’s lie inside what is called the Lie algebra of G. The 
unitarity and reality of the group elements, g, imply the matrices Ta to be hermitian 
and imaginary: 

n = n* = -t: = -r;. ( 1 . 5 . 3 ) 

representation is faithful if there is a one-to-one correspondence between the group elements and 
the matrices which represent them. Since the groups of interest are usually defined by a finite-dimensional 
and unitary representation, this representation is, by definition, faithful. 
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4. Since the generators, T^, are finite dimensional and hermitian, it follows that the 
matrix Nab = Tr(TaTfe) is positive definite. As a result we are free to redefine the 
generators to ensure that Nah = ^ab- With this choice there is no distinction to be 
made between indices a and b which are superscripts and subscripts. We assume 
this convenient choice to have been made in what follows. 

5. Closure of the group multiplication law — i.e. the statement that gi, g 2 ^ G implies 
fi'ifl '2 G G —implies commutation relations for the T^’s: TaTb — TbTa = iCabdTd 
where the c^m’s are a set of constant coefficients which are characteristic of the 
group involved. From its definition it is clear that Cabd is antisymmetric under 
the interchange of the indices a and b. Whenever the generators are chosen so 
that Nab = bab it also turns out that Cahd is completely antisymmetric under the 
interchange of any two indices. 

6 . For the present purposes it is convenient to choose a basis of generators which 
includes the generators of the subgroup FT as a subset. That is, choose {Tg} = 
{ti^Xa}, where the fj’s generate the Lie algebra of FT, and the Xq,’s constitute the 
rest. Since H is defined as the group which preserves the vacuum configuration, 
its generators must satisfy Uv = 0. The closure of the subgroup, FF, under multi¬ 
plication ensures that titj — tjti = icijk tk (with no X^’s on the right-hand-side). 
Schematically this can be written Cija = 0. 

7. The Xo’s do not he within the Lie algebra of FF, and so satisfy X^v ^ 0. They are 
said to generate the space, G/H, of cosets. A coset is an equivalence class which is 
defined to contain all of the elements of G that are related by the multiplication by 
an element of FF. Physically, the X^’s represent those generators of the symmetry 
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group, G, which are spontaneously broken. 


8. The group property of H described above, together with the complete antisymmetry 
of the CabdS implies a further condition: Ciaj = 0. This states that tiXa — XaU = 
iCiafdXg (with no tj’s on the right-hand-side). This states that the X^^’s fall into 
a (possibly reducible) representation of H. Once exponentiated into a statement 
about group multiplication, the condition tX — Xt oc. X implies, for any h E H, 
that hXah~^ = La^Xp for some coefficients, La^. 

By contrast, X^Xp — XpXa need not have a particularly simple form, and can be 
proportional to both X.^’s and 

1.5.3 The Toy Model Revisited 

Returning to the toy model dehned by the lagrangian, eq. (EO). we know Goldstone’s 
theorem implies that the assumed symmetry-breaking pattern must give rise to a collec¬ 
tion of massless Goldstone bosons, whose interactions we wish to exhibit explicitly. The 
Goldstone modes are, intuitively, obtained by performing symmetry transformations on 
the ground state. Since an inhnitesimal symmetry transformation on the ground state 
corresponds to the directions XaV in held space, we expect the components of (j) in this 
direction, v'^Xa<p, to be the Goldstone bosons. It is indeed straightforward to verify that 
the G-invariance of the lagrangian ensures the masslessness of these modes. There is 
consequently precisely one Goldstone degree of freedom for each generator of G/H. 

More generally, in order to make low-energy decoupling of these Goldstone bosons 

manifest we require that they do not appear at all in the scalar potential. Following the 

'*If the right-hand-side of Xa Xp — Xp X^ were assumed not to contain any X^'s, then the coset G/H 
would be called a symmetric space, but we do not make this assumption here. 
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example taken for the case of the abelian symmetry, we therefore change variables from 
(p{x) = {0*} to x(a;) = {x'^} and 6{x) = where 


^ = U(ff) X, 


( 1 . 5 . 4 ) 


and U{6) = exp[i0“(a:)Xa] is a spacetime-dependent symmetry transformation in the 
direction of the broken generators, X^,- 

In order for eq. ( |1.5.4|) to provide a well dehned change of variables, x must satisfy 
some kind of constraint. We therefore require that y be perpendicular (in held space) to 
the Goldstone directions, XaV. That is: 


v'^XaX = 0, for all and Xa 


( 1 . 5 . 5 ) 


Notice that this constraint — together with the identity v'^X^v = 0, which follows from 
the antisymmetry of the Xq’s — is precisely what is required to ensure the vanishing of 
the cross terms, proportional to d^d^d^x''^, in the quadratic part of the expansion of the 
kinetic terms about the ground state conhguration: y = u + yk 

Since U{d) is an element of G, the variable 9 is guaranteed to drop out of the scalar 
potential. Of course, this is the point of this change of variables, and it happens because 
G-invariance requires the potential to satisfy V{Ux) = Vix)- As a result, all of the terms 
in C which involve the Goldstone bosons, 9, vanish when = 0, and eqs. ( [1.5.4|) and 
(|1.5.5|) dehne the change of variables which makes explicit the low-energy Goldstone-boson 
decoupling. 


We pause now to briefly argue that it is always possible to satisfy eq. ( p..5.5|) starting 
from an arbitary smooth held conhguration, 0(x). That is, we argue that it is always 
possible to hnd a spacetime dependent group element, U{9) G G, for which y = 
satishes eq. (|1.5.5|) . To this end consider the following function, X[g{a)] = v'^g{a)(f), where 
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g{a) is an arbitrary, spacetime-dependent element of G. Focns, for a moment, on T as 
a fnnction of the parameters, of the gronp for a hxed spacetime position, x^. Since 
all of the variables, 0, v and g have been chosen to be real, and since the gronp, G, is 
compact, T{a) dehnes a real-valned fnnction having a compact range. It is a theorem 
that any snch fnnction must have a maximum and a minimum, and so there exist group 
elements, g = g{a), for which vanishes. Repeating this condition for each 

point in spacetime dehnes functions a“(a:) whose smoothness follows from the assumed 
smoothness of 0(a:). 

The hnal step in the argument is to show that the existence of these stationary 
points of T also give solutions to the problem of hnding a V for which y = satishes 

eq. (|1.5.5|) . This last step follows by explicitly taking the derivative of g with respect 
to and using parameters a such that {dg/da°‘)g~^ = Ta- In this case the vanishing 
of dJ^/da°', when evaluated at g = g = g(a), implies v’^Tagcj) = 0. We see that the 
choice X = with U = g~^, therefore satishes eq. (|1.5.5|) , and the existence of such 

a solution follows from the existence of a maximum and minimum of JF. 

This concludes the argument. We next explore the properties of the new variables. 


1.5.4 The Nonlinear Realization 


Having motivated this choice of variables, we now determine how they transform under the 
group G of symmetry transformations. The transformation rules which we obtain — and 
which we show to be unique, up to held redehnitions, below — carry all of the information 
concerning Goldstone boson properties, and so requiring low-energy lagrangians to be 
invariant under these transformations automatically encodes these properties into the 
low-energy theory. 
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The starting point is the transformation rule for 0: cf) ^ (p = gcp, where g = 
exp[ia“Ta]. The transformation rule for the new variables is then 6^6 and x ^ 
where p = U{6)x and p = U{9)x- That is, under any transformation, g & G, 9, x, 9 and 
X are related by: 

gU{9)x = U{9)x. (1.5.6) 

This last equation states that the matrix 7 = U~^gU (where U denotes U{9)) has 
the property that 77 = y. The central result to be now proven is that this condition 
implies that 7 must lie within the subgroup, H of unbroken transformations, and so may 
be written 7 = exp(mTi), for some function u’‘{9,g). Once this has been demonstrated, 
the transformation law therefore becomes: 


^ »“(», g) and x ^ x(^. 9. \'). 


(1.5.7) 


where 


g _ ^ie°‘Xa 


X = 


(1.5.8) 


The hrst of the eqs. (|1.5.8| ) should be read as dehning the nonlinear functions 0"(0, g) 
and u^{9,g). They are dehned by hnding the element, ge^^'^ G G, and then decomposing 
this matrix into the product of a factor, e®'^, lying in G/H times an element, in H. 
The second line of eqs. (p..5.8|) then dehnes the transformation rule for the non-Goldstone 
helds, y. 

These rules generally dehne transformation laws which are nonlinear in the Gold- 
stone helds, 0". They furnish, nonetheless, a faithful realization of the symmetry group G, 
in that 9{9, gig 2 ) = 9{9{9, 5 ^ 2 ), gi) etc.. This may either be directly verihed using the deh- 
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nitions of eqs. (|1.5.8|) , or by noticing that this property is inherited from the faithfulness 
of the original linear representation of G on 0. 


There is a particularly interesting special case for which eqs. ([1.5.8|) can be explicitly 
solved for 7 = * and U = This is when g = h lies in H. In this case, the solution 

is easily seen to be: 7 = h and U = hUh~^. Both y and 6 therefore transform linearly 
under the unbroken symmetry transformations, H. That is: 




X 


= h h-\ 
X = hx, 


(1.5.9) 


for all h E H. 

For the broken symmetries, g G G/H, it is useful to specialize to an inhnitesimal 
transformation, g = 1 + ioj°‘Xa + ■ ■ ■. In this case we have 7 = 1 + + • ■ •, 

and U{6) = U{6)[1 + iA°'{6,uj)Xa + ■■■], where u'^{6,u) and A(6',a;) must both also be 


inhnitesimal quantities. Eq. ( 1.5.8 ) gives them explicitly to be: 


A„ = Tr 


X^e-^^'^(u-X)e 


i6-X 


— Cal3'yUJ^9'^ + 0 ( 0 ^); 


Ui = Tr 


c,o^gu^9^ + 0{9^) 


(1.5.10) 


(1.5.11) 


where we used: Tr(AaA^) = Sag, Tr(tjtj) = 6ij and TrlgtiXa) = 0 . 

This last expression for Aa{9,uj) can be re-expressed in terms of the change, 50“ = 
^“( 0 , 0 ;) = 0“ — 0", of the Goldstone-boson helds. The relation between A^ and is 
linear: Aq = Mag{9)^^, where the matrix, Mag, of coefficients may be computed using 
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the following useful identity, which holds for any two square matrices, A and B: 

e-^A e^A+B) ^ l + i ds 

Jo 

= 1 + i + (1.5.12) 

Jo 

Using this with A = 6 ■ X and B = ^ ■ X gives: 

M^(3= Trfx„e-“Xfle**®'^l. (1.5.13) 

Jo L J 

The transformation rules for the d" with respect to the broken symmetries in G/H 
have two important properties. The first crucial property is that the transformation law 
is inhomogeneous in the broken symmetry parameters, since 

59'^ = uj'^ + 0{e^). (1.5.14) 

As was observed earlier for the abelian example, it is this property which enforces the 
decoupling of the Goldstone bosons at low energies. 

The second important property is that, for a nonabelian group, the symmetries 
in G/H act nonlinearly on the helds d". This property is also signihcant since it ruins 
many of the consequences which would otherwise hold true for symmetries which are 
linearly realized. For example, the masses of those particles whose helds lie in a linear 
representation of a symmetry group necessarily have equal masses, etc.. The same is not 
true for particles whose helds are related by nonlinear transformations. 

There is a corollary which follows from the nonlinearity of the realization of the sym¬ 
metries in G/H. The fact that the transformation of d" and both held dependent 

implies that the action of these symmetries are spacetime dependent. For example, even 
though the transformation parameters themselves, cu", are constants — since G is a global 
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symmetry — the transformation matrix 7 = which appears in the y transformation 
law is not a constant, d^'-j 7 ^ 0. This fact complicates the construction of lagrangians 
which are invariant with respect to these symmetries. 

1.6 Invariant Lagrangians 

With the transformation rules for the Goldstone boson helds in hand we may now turn 
to the construction of invariant Lagrangians which can describe their low-energy inter¬ 
actions. The main complication here is in the construction of the kinetic terms, since 
the transformation rules for the helds are spacetime dependent due to their complicated 
dependence on the helds. 

A clue as to how to proceed can be found by reconsidering the toy model of scalar 
helds 0. In this case the kinetic term, proportional to (9^0"^(9^0 is manifestly G invariant. 
It must therefore remain so after performing the change of variables to 9 and y. To 
see how this is comes about, we notice that after the replacement 0 = f/y we have: 
d^(j) = U{dfj,x + U~^d^jUx)- In terms of the new variables, the kinetic term is invariant 
because the combination V^x = 9^lX + U~^d^Ux transforms covariantly: V^x 
It does so because U~^d^U transforms like a gauge-potential: 

u-^d^u ^ ( 1 . 6 . 1 ) 

= 7 {U-^d^U) 7 “^ - (9^7 7 “^ 

More information emerges if we separate out the parts of U~^dyU which are pro¬ 
portional to Xq, from those which are proportional to t* since the inhomogeneous term, 
<9/^7 7 ~^, is purely proportional to 0 . That is, if we dehne: 


U-'d^U = -iA^U + ie“X, 


( 1 . 6 . 2 ) 



then each of these terms transforms separately under G transformations: 


- ^ = 7 7 ' - 9„7 7 \ 

^ *e“(e)X„ = 7 7-‘. (1.6.3) 

We see that the quantity transforms like a gauge potential. For inhnitesimal 
transformations, (7 ~ 1 + iuj^Xa and 'y{9,g) Ri 1 + iu^{9,uj)ti, we have: 

6A^^{9) = d^u\9,u;) - d,kU^{9,u;)A’;{9). (1.6.4) 


Similarly, e"( 6 ') transforms covariantly, with 


KW 


-c“tgu'(e,uj) em. 


(1.6.5) 


In this last expression, the structure constants dehne representation matrices, {Ti)ap = 
Caip, of the Lie algebra of H. These are the same matrices which dehne the representation 
of H that the generators form, and it is important for later purposes to recall that 
these representation matrices need not be irreducible. If this representation is reducible 
then it is possible to dehne more G-invariant quantities with which to build the low energy 
lagrangian than would otherwise be possible. 

If we extract the overall factor of 8^9°^, so that = Al^{9) 8^9°^ and e“ = 
e°'p(9) 8 ^ 99 ^ then the identity, eq. (|1.5.12|) , gives the following expressions for the co¬ 
efficients: 


and 


Aid) = 


ds Tr 


he 


-ise-x 


Xo^e 


ise-x 


]Ciap9d -f 0(6*^), 


( 1 . 6 . 6 ) 


e^p{9) = ds Tt 


XfyC 


-ise-x 


Xpe 


ise-x 
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( 1 . 6 . 7 ) 


^a0 — -Ca0'y0'^ + 0{6^). 

With these tools it is now clear how to build G-invariant couplings among the 0", 
and between the 6'"’s and other helds, such as x from the scalar-held example. 

It is simplest to build self-interactions for the Goldstone bosons. An invariant La- 
grangian density may be built by combining the covariant quantity, e" = in all 

possible iJ-invariant ways. This is simple to do since this quantity transforms very simply 
under G: ■ X ^ ■ X'y~^. 

Derivatives of e" can also be included by differentiating using the covariant derivative 
constructed from 

ef, (1.6.8) 

which transforms in the same way as does e": 

The lagrangian is C{e^, ...), where the ellipses denote terms involving higher 

covariant derivatives. Provided only that this lagrangian is constrained to be globally H 
invariant: 

C{he^h~^, hD^eyh ~^,...) = £(e^, ...), (1.6.9) 

the result is guaranteed to be automatically globally G invariant, as required. For a 
Poincare invariant system, the term involving the fewest derivatives therefore becomes: 

Cqb = --fayV^" -|- (higher-derivative terms). (1.6.10) 

In this expression, positivity of the kinetic energy implies that the matrix must 
be positive dehnite. G-invariance dictates that it must also satisfy fxyc^ia + fa\c^i 0 = 0. 
It was remarked earlier that on general grounds the matrices, £11 out a representation, 
TZ, of the unbroken subgroup H with representation matrices given by ( 7 i)a 0 = Cai 0 , and 
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in terms of these matrices G-invariance requires the vanishing of all of the commutator, 
[Ti, f], for all of the generators, 7^. If this representation, TZ, of H is irreducible then, 
by Schur’s lemma, fap must be proportional to the unit matrix, with positive coefficient: 
fa 0 = Otherwise, if TZ is reducible into n irreducible diagonal blocks, then fajs 

need only be block diagonal, with each diagonal element being proportional to a unit 
matrix: 


faP — 




V 


( 1 . 6 . 11 ) 


F‘^5 


0!.nPn 


for n independent positive constants, F^. We see that the lowest-dimension terms in the 
most general low-energy Goldstone-boson self-coupling lagrangian is parameterizable in 
terms of these n constants. 

If other helds — denoted here collectively by y — also appear in the low-energy 
theory then, since the symmetry H is not broken by the ground state, the fields y must 
also transform linearly under H\ y —»• hx, where the matrices {h} form a (possibly 
reducible) representation of F[. In this case the starting point for inferring the coupling 
of the Goldstone bosons is an arbitrary, globally if-invariant lagrangian: C{x, dfiXi ■ ■ •) 
with C{hx, hd^x^ • • •) = F{x, ■ ■ Oi ^r d^h = 0. This lagrangian will be automatically 
promoted to become G-invariant by appropriately coupling the Goldstone bosons. 

The promotion to G invariance proceeds by assigning to y the nonlinear G-transformation 
rule: y —*• yy, where 7 = 'y{6,g) G 77 is the held-dependent 77 matrix which is dehned 
by the nonlinear realization, eq. (|1.5.8| ). An arbitrary globally 77-invariant y-lagrangian 
then becomes G invariant if all derivatives are replaced by the T'-dependent covariant 
derivative: 5^y ^ D^x = d^,x - iA%x, for which D^x ^ iD^x- 

The general lagrangian therefore becomes: G(e^, y, 77^6^, 77^y,...), where G-invariance 
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is ensured provided only that C is constrained by global H invariance: 


C{he^h ,hx,hD^e^h , h ...) = £(e^, y, .. .)• 


( 1 . 6 . 12 ) 


1.7 Uniqueness 


The previous construction certainly dehnes a G-invariant lagrangian for the interactions 
of the Goldstone bosons which arise from the symmetry-breaking pattern G —> if, given 
the transformation rules which were derived in earlier sections. Our goal in the present 
section is to show that this construction gives the most general such invariant lagrangian. 
That is, we wish to show that the most general lagrangian density which is invariant under 
the transformation rules of eqs. (|1.5.8|) may be constructed using only the quantities e“(6') 
and in addition to any other helds, y. 

We start with a general lagrangian density, C{9,d^9,x,d^x)j involving the helds 
0", x” and their derivatives. We do not include a dependence on second and higher 
derivatives of these helds, but this extension is straightforward to make along the lines 
that are described in this section. It is more convenient in what follows to trade the 
assumed dependence of C on d^9 for a dependence on the combinations e" = d^9^ 

and = Al^{9) 5^6*“. There is no loss of generality in doing so, since any function of 9 
and dfj,9 can always be written as a function of 9, e“ and A^^^. This equivalence is most 
easily seen in terms of the matrix variable U{9) = Any function of 9 and d^9 can 

equally well be written as a function of U and df^U, or equivalently as a function of U 
and U~^dJJ. But an arbitrary function of U~^daU is equivalent to a general function of 


e" and A^^^ as may be seen from expression (|1.6.2|) . 

The condition that a general function, C{9°‘,e^,A’’^,x,dfj,x), be invariant with re- 
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spect to G transformations is: 


d£, dC dC d£ d£ 

= K + ^ +T^5x"+ -T = 0. 

ae“ ^ dAl “ dx^ ^{^^lX'^) 


oe^ 


(1.7.1) 


We first specialize to the special case where the symmetry transformation lies in H: 
g = e®’''* G H. In this case we mnst nse, in eq. (EID, the transformations: 

59“ = Sel = -c“.^i,‘e9, 5^ = 

and 5x” = 9')‘(*iX)“. 5a„x” = 9')'(*A.x)”- (1-7.2) 

Reqniring 5/1 = 0 for all possible transformation parameters, 77®, then implies the following 
identity: 


dC 


c'^ipd^ + c ige^ + c’ikA^ 


dC 


dC 


dC . 


Kuxy 


dC 


i{Ud^xT = Q. (1.7.3) 


de^ ^ de?. ^ ^ dAi. ^ dx^ d{d^x") 

This identity simply states that C mnst be constrncted to be an //-invariant fnnction of 
its argnments, all of which transform linearly with respect to H transformations. 

For the remaining symmetry transformations which do not lie in if, = e®®^'^ G 
G/H, we instead evalnate eq. (|1.7.1|) nsing the following transformations: 

59 " = 54 = a,u‘ - c‘,tn’A% 

and 5%®® = iu^UxT = iu^iUd^xT, (1-7.4) 

where oj^ and n® = ^^(0) a;" are the nonlinear fnnctions of 9 that are dehned 

by eq. ( |1.5.8|) . Using these in eq. ( |1.7.1| ), and simplifying the resnlting expression nsing 
eq. (|1.7.3|) , leads to the remaining condition for G invariance: 


dC / ^ dC 

dQa 1^/3 + 


^ ^d^u^giUxT = 0. (1.7.5) 
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This last identity contains two separate pieces of information. The hrst piece can be 
extracted by specializing to 6*" = 0. In this case, since = daU^p d^9°‘ vanishes when 
9°‘ = 0, and since eq. (|1.5.141 ) implies = 0) = 6 ^, we hnd: 

dC 


89 ^ 


= 0 . 


(1.7.6) 


0=0 


But, since the group transformation law for is inhomogeneous, we may always perform 
a symmetry transformation to ensure that 0“ = 0 for any point p & G/H. As a result, 
eq. (|1.7.6|) implies the more general statement: 

dC 


89 ^ 


= 0 throughout G/H. 


(1.7.7) 


The rest of the information contained in eq. (|1.7.5|) may now be extracted by using 
8C/89°' = 0 to eliminate the hrst term. One hnds the remaining condition: 


8 C 8C „ ,■ 

+ -TT AtjX) 8^U^p = 0. 


(1.7.8) 


\8Ai 8{8f,x'') 

This equation has a very simple meaning. It states that C can depend on the two variables, 
Ajj^ and 8 ^x^j oiily through the one combination: {D^xY = ~ That is, 

X can appear differentiated in C only through its covariant derivative, D^x- 

We see from these arguments that the G-invariance of C is equivalent to the state¬ 
ment that C must be an TT-invariant function constructed from the covariantly-transforming 
variables e“, y and D^x- If higher derivatives of 9 had been considered, then the van¬ 
ishing of the terms in 5C that are proportional to more than one derivative of m* would 
similarly imply that derivatives of e" must also only appear through its covariant deriva¬ 


tive, dehned by eq. (|1.6.8|) . This proves the uniqueness of the construction of 

invariant lagrangians using these covariant quantities. 
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1.8 The Geometric Picture 


There is an appealing geometric description of the resnlting effective lagrangian, which 
makes available many powerfnl techniqnes from differential geometry to effective la¬ 
grangian methods. We panse here to briefly ontline this connection. 

Consider, for simplicity, only the self-interactions of the Goldstone bosons: Cgb{9). 
This can be expanded into terms having increasing nnmbers of derivatives acting on 6 . 
The hrst few terms that are consistent with Poincare invariance are: 

CaB{9) = -V{9) - ^go.fi{9) d^9‘^d^9^ + • • •. (1.8.1) 

Positivity of the kinetic energy for flnctnations abont any conhgnration, 0", reqnires the 
symmetric matrix to be positive dehnite for all 9. 

The geometrical interpretation arises once we recall that the helds 9 take valnes in 
the coset space G/H, and so each Goldstone boson held conhgnration can be considered 
to be a map from spacetime into G/H. The fnnction, V, can then be considered to be a 
real-valned fnnction which is dehned on the space G/H. Similarly, the positive symmetric 
matrix, g^g, dehnes a metric tensor on G/H. These identihcations of V and gag with 
geometrical objects on G/H are consistent with their transformation properties nnder 
held redehnitions, 50" = ^"(0), which are the analognes of coordinate transformations on 
G/H. To see this, perform this transformation in the lagrangian of eq. (|1.8.1|) . The resnlt 
is to replace V and gag by the qnantities V + £.^V and gag + £^gag, where the linear 
operator, £^, is known as the Lie derivative in the direction specihed by and is given 
explicitly for a scalar and a covariant tensor by: 

£^V = CdaV, 

and £^gag = ^^dxgag + gxgda^^ + gaxdg^^. (1.8.2) 
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In these expressions derivatives, like 9^1/, d\gai 3 or all represent differentiation with 
respect to 6*", and not with respect to spacetime position, x^. 

Clearly, the G invariance of the hrst few terms of Cqb therefore becomes eqnivalent 
to the problem of hnding a scalar and a metric for which £^V = £^gai 3 = 0 for each of the 
^“(6*)’s which describe the action of G on G/H. Since every point on G/H can be reached 
from any other by performing such a G transformation — i.e. G/H is a homogeneous 
space — it follows that the only possible invariant function, V, is a constant which is 
independent of 0. This expresses the low-energy decoupling of the Goldstone bosons since 
it shows that G invariance completely forbids their appearance in the scalar potential. 

Similarly, the condition for the G invariance of the kinetic terms is that the metric 
gag must also be invariant under the action of all of the .^“(6')’s which generate G on 
G/H. That is, all of these ^’s must generate isometries of the metric gag- The problem of 
hnding the most general invariant kinetic term is therefore equivalent to constructing the 


most general G-invariant metric on G/H. A comparison of the lagrangian of eq. (|1.8.1| ), 
with our earlier result, eq. (|1.6.1U| ), gives a representation of the metric gag in terms of 
the quantities e°‘g{9). We have 


9ag f'fX OL^ g-i 

n 

r=l 

n 

~ T. Fr + o(eg 

r=l 


(1.8.3) 


Eq. (|1.8.3|) also has a geometric interpretation. It shows that the object e^g can 
be interpreted as a G-covariant vielbein for the space G/H. In Reimannian geometry a 
vielbein is the name given to a set of N linearly independent vectors, e°‘g, a = 1,... ,N, 
which are tangent to an A^-dimensional space. (The name is German for ‘many legs’, with 
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viel meaning ‘many’, and so such vectors are also called zweibein in two dimensions — 
with zwei = ‘two’ — or vierbein in four dimensions — with vier = ‘four’.) Part of the 
utility of identifying such a set of vectors is that it is always possible to reconstruct from 
them the space’s metric, using: = Sab 

The geometrical interpretation of as a vielbein, and the uniqueness of the con¬ 
struction of invariant lagrangians proven in the previous section, gives the general solution 
to the geometrical problem of constructing G-invariant metrics on the space G/H. We 
see that there is an n-parameter family of such metrics, where n counts the number of 
irreducible representations of H which are formed by the generators, Xa, of G/H. The n 
parameters are given explicitly by the constants F/,r = 1,... ,n. 

For many physical applications the representation of H that is furnished by the is 
irreducible, and in this case the G-invariant metric is uniquely determined up to its overall 
normalization: gag = F'^gag, with gag = Sag + O {6). For such systems there is precisely 
one constant in the effective lagrangian for Goldstone bosons which is undetermined by 
the symmetries of the problem, if we include only the fewest possible (two) derivatives: 

F'^ 

Cgb = —— dagiO) d^9‘^d^9^ -1- (higher derivative terms). (1.8.4) 

Once the one constant, is determined, either by calculation from an underlying the¬ 
ory, or by appeal to experiment, the lowest-order form for all of the Goldstone boson 
interactions are completely determined by the symmetry breaking pattern. The resulting 
model-independent predictive power has wide applications throughout physics, as we shall 
see when we consider examples in subsequent chapters. 
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1.9 Nonrelativistic Lagrangians 


Before turning to examples, we pause to outline the changes in the above analysis which 
become necessary when it is applied to nonrelativistic systems, for which the space- 
time symmetry is not Poincare invariance. This is what is appropriate, for example, 
in condensed-matter applications for which there is a preferred frame, dehned by the 
centre-of-mass of the medium of interest. 

So long as our attention is restricted to internal symmetries, most of the considera¬ 
tions of this chapter apply just as well to nonrelativistic problems as they do to relativistic 
ones. In particular, the expressions obtained for the nonlinear realization of broken sym¬ 
metries on the Goldstone boson helds does not depend at all on the spacetime symmetries 
which are assumed. 

We assume for simplicity here that the system remains invariant with respect to 
translations and rotations, although the generalization to different spacetime gronps is 
straightforward in principle. In practice, the resnlts we obtain also apply to some lattice 
systems, for which translations and rotations are not symmetries. This is because, at 
least for the hrst few derivatives in a derivative expansion of the lagrangian, the lattice 
gronp for some lattices, snch as a cnbic lattice for example, implies the same restrictions 
as do translation and rotation invariance. 

There are two cases, which we consider separately below, depending on whether or 
not time reversal is a good symmetry of the system. 

1.9.1 When Time-Reversal is a Good Symmetry 

For nnbroken time reversal, the main difference from the relativistic sitnation lies in the 
fact that the time- and space-derivatives become independent since they are nnrelated 
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by any symmetries. For instance, assuming unbroken translation and rotation invariance, 
the most general self-couplings amongst the Goldstone bosons for the symmetry-breaking 
pattern G ^ H are: 



-|-(higher derivative terms). (1.9.1) 

Compared to the relativistic case, twice as many — that is, 2n — constants, F^r),t 

are required to parameterize the terms containing the fewest number of deriva¬ 
tives. Once these two constants are determined, all other interactions at this order of the 
derivative expansion are clearly completely determined. 

1.9.2 When Time-Reversal is Broken 

New possibilities arise for the lagrangian when time-reversal symmetry is broken, as is 
the case for a ferromagnet, for example. In this case, it is possible to write down terms 
in CcB which involve an odd number of time derivatives. In particular, the dominant, 
lowest-dimension term involving time derivatives involves only one: 

ACob = -Ao^iO) r. (1.9.2) 

The coefficient function, Aa{9), can be considered to dehne a vector held on the coset 
space G/H. It is to be chosen to ensure the G invariance of the low-energy theory. 

If ACqb is required to be invariant under G transformations, then it must be built 
using the quantity e"^(6*) = e°‘f3{9) and its covariant derivatives. If it is to involve 
only a single time derivative, then it must be proportional only to eo(^). But the only such 
G-invariant quantity is: where the constants, ka, must satisfy kaF^ip = 0 for all 

indices i and {3. Such a k^ exists only if the corresponding generator, Xq, commutes with 
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all of the generators of the unbroken subgroup, H. This quite restrictive condition is not 
ever satisfied in many situations of physical interest, and for these systems it appears — 
at least superhcially — that no terms involving only a single time derivative are consistent 
with G-invariance. 

This conclusion would be too strong, however, because it is too restrictive a condition 
to demand the G-invariance of the lagrangian density, jC. We are only required by G 
symmetry to demand the invariance of the action. The lagrangian density need not 
be invariant, provided that its variation is a total derivative. It is therefore worth re¬ 


examining the time-reversal-violating term of eq. ( [1.9.2|) in this light. 

Once we drop total derivatives in ACqb, h is clear that the coefficient Aa(0) is 
only defined up to the addition of a gradient. That is, any two choices Aa and Aa = 
Aa + 5oO(6'), differ in their contribution to ACqs only by the total derivative: 


aCgb{a + on) - ACcBiA) = -dM{ 0 ) 9 = - h{e). 


(1.9.3) 


In geometrical terms we may therefore regard the coefficient function, Ao,{9), as defining 
a gauge potential on the coset space G/H. 

The condition that ACqb contribute a G-invariant term to the action therefore 
only requires the coefficient Aa{6) to be G-invariant up to a gauge transformation. In 
equations, G-invariance of the action only requires: 

(1.9.4) 

for each generator 50" = of the action of G on G/H, and for some scalar functions, 
14^(0), on G/H. This last condition is equivalent to demanding the invariance of the 
gauge-invariant quantity: = d^Ap — dpA^. That is, 

S.'iFag = + Fxpdai^ + Faxdfsf^ = 0. (1.9.5) 
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We shall find that this condition does admit solutions in many cases of interest — 
most notably for the example of a ferromagnet. 

1.10 Power Counting 

Before proceeding to real-life applications, a hnal important issue must be addressed. 
Since the lagrangians expressed using Goldstone boson variables is typically nonrenormal- 
izable, it is necessary to know how to use nonrenormalizable lagrangians when making 
quantitative calculations. 

The key to doing so is to consider the Goldstone Boson lagrangians to which we 
have been led in previous sections to be ‘effective theories’ which describe only the low- 
energy behaviour of the system of interest. For instance, in our toy models the Goldstone 
bosons (0“) are massless while the other degrees of freedom (y) are not. (Although the 
pseudo-Goldstone bosons for an approximate symmetry are not exactly massless, they 
may nonetheless appear in the low-energy theory so long as their mass, m, is sufficiently 
small.) A lagrangian involving only Goldstone bosons or pseudo-Goldstone bosons can 
only hope to describe physics at energies, q, below the mass threshhold, M, for producing 
the heavier particles. (It is often the case that M is proportional to the symmetry-breaking 
scale(s), v). The predictions of such a lagrangian are to be regarded as reproducing, in 
powers of g/M, whatever the ‘underlying’ (or ‘microscopic’) theory — i.e. the theory 
involving the heavy y states — might be. 

In order to make this concrete, consider one such a Lagrangian, having the form: 

= ( 1 . 10 . 1 ) 

where 0 denotes a generic boson held, c„ are a set of dimensionless coupling constants 


41 



which we imagine to be at most 0(1), and /, M and v are mass scales of the underlying 
problem. (For example, in the application to pions which follows we will have / = ^ 

M = Ay. and v = where F^^ and are scales which characterize the strength of the 
appropriate symmetry breaking in the strong interactions.) dn is the dimension of the 
operator On, in powers of mass, as computed by counting only the dimensions of the field, 
0, and derivatives, (90. 

Imagine using this lagrangian to compute a scattering amplitude, AE^q), involving 
the scattering of E particles whose four-momenta are collectively denoted by q. We wish 
to focus on the contribution to A due to a Feynman graph having I internal lines and 
Vik vertices. The labels i and k here indicate two characteristics of the vertices: i counts 
the number of lines which converge at the vertex, and k counts the power of momentum 
which appears in the vertex. Equivalently, i counts the number of powers of the fields, 0, 
which appear in the corresponding interaction term in the lagrangian, and k counts the 
number of derivatives of these fields which appear there. 

1.10.1 Some Useful Identities 

The positive integers, I, E and Vik, which characterize the Feynman graph in question 
are not all independent since they are related by the rules for constructing graphs from 
lines and vertices. This relation can be obtained by equating two equivalent methods of 
counting the number of ways that internal and external lines can end in a graph. On the 
one hand, since all lines end at a vertex, the number of ends is given by summing over 
all of the ends which appear in all of the vertices: J2ik * ^ik- On the other hand, there are 
two ends for each internal line, and one end for each external line in the graph: 21 + E. 
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Equating these gives the identity which expresses the ‘conservation of ends’: 

21 + E = '^i Vik, (Conservation of Ends). (1.10.2) 

ik 

A second useful identity defines of the number of loops, L, for each (connected) 
graph: 

L = 1 + J — ^ Vik, (Dehnition of L). (1.10.3) 

ik 

For simple planar graphs, this dehnition agrees with the intuitive notion what the number 
of loops in a graph means. 

1.10.2 Dimensional Estimates 

We now collect the dependence of .4,^(0) on the parameters in 

Reading the Feynman rules from the lagrangian of eq. (|1.10.1| ) shows that the vertices 
in the Feynman graph contribute the following factor: 


(Vertex) = J([ 

ik 


(i))* 


Vik 


(1.10.4) 


where p generically denotes the various momenta running through the vertex. 

Similarly, each internal line in the graph contributes the additional factors: 

(Internal Line) = (—) . (1.10.5) 

where, again, p denotes the generic momentum flowing through the line, m denotes the 
mass of the light particles which appear in the effective theory, and it is assumed that 
the kinetic terms which dehne their propagation are those terms in involving two 
derivatives and two powers of the helds, 0. 

As usual for a connected graph, all but one of the momentum-conserving delta func¬ 
tions in eq. (|1.10.4|) can be used to perform one of the momentum integrals in eq. (p..l0.5|). 
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The one remaining delta function which is left after doing so depends only on the external 
momenta, S‘^{q), and expresses the overall conservation of four-momentum for the process. 
Future formulae are less cluttered if this factor is extracted once and for all, by dehning 
the reduced amplitude. A, by 

= i{27r)‘^6{q) Asiq). ( 1 . 10 . 6 ) 

The number of four-momentum integrations which are left after having used all of 
the momentum-conserving delta functions is then I — J2ik Vik + ^ = L. This last equality 
uses the dehnition, eq. (|1.10.3|) , of the number of loops, L. 

We now wish to estimate the result of performing the integration over the internal 
momenta. In general these are complicated integrals for which a simple result is not 
always possible to give. Considerable simplihcations arise, however, if all of the masses 
and energies of the particles in the low-energy theory are of the same order of magnitude, 
since in this case much can be said about the order of magnitude of the momentum 
integrals purely on dimensional grounds. (Although this is often the situation of interest 
when employing effective theories, it must be borne in mind that it does not always apply. 
For instance it excludes a situation of considerable practical interest, where the low-energy 
theory includes very massive but slowly-moving, nonrelativistic particles. Power counting 
for such systems is beyond the scope of this review.) 

In order to proceed with a dimensional argument it is most convenient to regulate 
the ultraviolet divergences which arise in the momentum integrals using dimensional reg¬ 
ularization. For dimensionally-regularized integrals, the key observation is that the size 
of the result is set on dimensional grounds by the light masses or external momenta of 
the theory. That is, if all external energies, q, are comparable to (or larger than) the 
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masses, m, of the light particles whose scattering is being calculated, then q is the light 
scale controlling the size of the momentum integrations, so dimensional analysis implies 
that an estimate of the size of the momentum integrations is: 


d^p 


p 


B 


2A 


q 


.nA+B—2C 


(1.10.7) 


(p2 -)- g2^C 

with a dimensionless prefactor which carries all of the complicated dependence on dimen¬ 
sionless ratios like q/m. The prefactor also depends on the dimension, n, of spacetime, 
and may be singular in the limit that n —> 4.Q 

With this estimate for the size of the momentum integrations, we hnd the following 
contribution to the amplitude .4 b (g): 

f f ( d'^P \ ^ pSifc f 


(—) ^ ( 1 . 10 . 8 ) 


J J y(27r)"^y (p2 -|- q'^y 

which, with liberal use of the identities (|1.10.2|) and (|1.10.3|) , gives as estimate for Asiq)- 


Ayq) ~ r 1 ^- 


1 \ ® 


AnP 


q \ 2+2L+yi^.,_(A:—2)Vifc 

mJ 


(1.10.9) 


This expression is the principal result of this section. Its utility lies in the fact that it links 
the contributions of the various effective interactions in the effective lagrangian, ( [1.10.1|) , 
with the dependence of observables on small mass ratios such as q/M. As a result it 
permits the determination of which interactions in the effective lagrangian are required 
to reproduce any given order in g/M in physical observables. 

Most importantly, eq. ( |1.10.9| ) shows how to calculate using nonrenormalizable the¬ 
ories. It implies that even though the lagrangian can contain arbitrarily many terms, and 
so potentially arbitrarily many coupling constants, it is nonetheless predictive so long as 

its predictions are only made for low-energy processes, for which q/M -C 1. (Notice also 
^We ignore here any logarithmic infrared mass singularities which may arise in this limit. 
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that the factor {M /in eq. ( P.-10-9 ) implies, all other things being equal, the scale / 
cannot be taken to be systematically smaller than M without ruining the validity of the 
loop expansion in the effective low-energy theory.) 

Before stating more explicitly the effective-lagrangian logic, which eq. ( |1.1U.9| ) sum¬ 
marizes, we pause to generalize it to include fermions in the low-energy effective theory. 

1.10.3 Including Fermions 

It is straightforward to extend these results to include light fermions in the effective theory, 
although once again subject to the important assumption that all masses and energies 
are small in the effective theory. To this end, hrst generalize the starting form assumed 
for the lagrangian to include fermion helds, Yi addition to boson helds, 0: 


^eff _ rA \ '' /r\ 




„ ’ 3/2 / • 

Vp ) 


( 1 . 10 . 10 ) 


An important difference between fermion and boson propagators lies in the way each 
falls off for large momenta. Whereas a boson propagator varies like 1/p^ for large p, a 
fermion propagator goes only like 1/p. This leads to a difference in their contribution to 
the power counting of a Feynman graph. It is therefore important to keep separate track 
of the number of fermion and boson lines, and we therefore choose to now label vertices 
using three indices: /c, ip and ip. As before, k labels the numbers of derivatives in the 
corresponding interaction, but now ip and ip separately count the number of bose and 
fermi lines which terminate at the vertex of interest. The number of vertices in a graph 
which carry a given value for fc, ip and ip we now label by Vigi^k- 

Consider now computing an amplitude which has Eg external bosonic lines, Ep 
external fermion lines, and Ip and Ip internal bose and fermi lines. Repeating, with the 
lagrangian of eq. (|1.10.10|) , the power counting argument which led (using dimensional 
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regularization) to eq. (|1.10.9 ) now gives instead the following result: 



1 \ 


\ ‘iEp (2 



2L 


P 


~^EB,Ep{q) ~ 


( 1 . 10 . 11 ) 


where the power P can be written in either of the following two equivalent ways: 



( 1 . 10 . 12 ) 


1.11 The Effective-Lagrangian Logic 

The powercounting estimates just performed show how to organize calculations using 

nominally nonrenormalizable theories, considering them as effective held theories. They 

suggest the following general logic concerning their use. 

Step I Choose the accuracy {e.g. one part per mille) with which observables, such as 
Asiq), are to be computed. 

Step II Determine the order in the small mass ratios q/M or m/M that must be required 
in order to acheive the desired accuracy. 

Step III Use the power counting result, eq. (|1.10.9|) , to hnd which terms in the effective 
lagrangian are needed in order to compute to the desired order in q/M. Eq. (|1.10.9|) 
also determines which order in the loop expansion is required for each effective 
interaction of interest. 

Step IVa Compute the couplings of the required effective interactions using the full 
underlying theory. If this step should prove to be impossible, due either to ignorance 
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of the underlying theory or to the intractability of the required calculation, then it 
may be replaced by the following alternative: 


Step IVb If the coefficients of the required terms in the effective lagrangian cannot be 
computed then they may instead be regarded as unknown parameters which are 
to be taken from experiment. Once a sufficient number of observables are used to 
determine these parameters, all other observables may be unambiguously predicted 
using the effective theory. 

A number of points bear emphasizing at this point. 

1. The possibility of treating the effective lagrangian phenomenologically, as in Step 
IVb above, immeasurably broadens the utility of effective lagrangian techniques, 
since they need not be restricted to situations for which the underlying theory is 
both known and calculationally simple. Implicit in such a program is the underlying 
assumption that there is no loss of generality in working with a local held theory. 
This assumption has been borne out in all known examples of physical systems. It 
is based on the conviction that the restrictions which are implicit in working with 
local held theories are simply those that follow from general physical principles, such 
as unitarity and cluster decomposition. 


2. Since eq. (|1.10.9|) — or eqs. ( |I.10.11| ) and (|1.10.12|) — states that only a hnite 
number of terms in contribute to any hxed order in q/M, and these terms need 
appear in only a hnite number of loops, it follows that only a hnite amount of labour 
is required to obtain a hxed accuracy in observables. 


Renormalizable theories represent the special case for which it suffices to work only 
to zeroeth order in the ratio q/M. This can be expected to eventually dominate 
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at sufficiently low energies compared to M, which is the reason why renormalizable 
theories play such an important role throughout physics. 

3. An interesting corollary of the above observations is the fact that only a hnite 
number of renormalizations are required in the low-energy theory in order to make 
hnite the predictions for observables to any hxed order in q/M. Thus, although an 
effective lagrangian is not renormalizable in the traditional sense, it nevertheless is 
predictive in the same way that a renormalizable theory is. 


49 



Chapter 2 

Pions: A Relativistic Application 


We now present a relativistic application of these techniques to the low-energy interac¬ 
tions of pions and nucleons. This example provides a very useful, and experimentally 
successful, description of the low-energy limit of the strong interactions, and so illustrates 
how effective lagrangians can remain predictive even if it is impossible to predict their 
effective couplings from an underlying theory. This example is also of historical inter¬ 
est, since the study of low-energy pion scattering comprises the context within which the 
above Goldstone-boson formalism initially arose. 

2.1 The Chiral Symmetries of QCD 

The modern understanding of the strong interactions is based on the theory of mutually 
interacting spin-half quarks and spin-one gluons that is called Quantum Chromodynamics 
(QCD). This theory is described by the lagrangian density 

£000 = - jG:.Gr - (2.1.1) 

^ n 

where + gf\, G^^gl is the held strength tensor for the gluon helds. 

Here a = 1,..., 8 labels the generators of the gauge symmetry group of the theory. 
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SUc{3), for which the are the structure constants. The subscript ‘c’ of SUc{3) stands 
for ‘colour’, which is the name given to the strong charge. 

The quarks are represented by Dirac spinors, where n = 1, • • •, 6 counts the six 
kinds of quarks. In order of increasing mass, these are: u, d, s, c, b and t. For the purposes 
of later comparison we list here the quark masses, mg„, in GeV: = 0.0015 — 0.005, 

rud = 0.003 — 0.009, rus = 0.06 — 0.17, rric = 1.1 —1.4, rrib = 4.1—4.4 and rrit = 173.8±5.2. 
All of these quarks are assumed to transform in the three-dimensional representation of 
the colour symmetry group, SUc{3), and so their covariant derivative (which appears in 
the combination .p = in the lagrangian) is: D^qn = d^Qn — The eight 

matrices, Aa, denote the three-by-three Gell-Mann matrices, which act on the (unwritten) 
colour index of each of the quarks. The explicit form for these matrices is not required in 
what follows. In all of these expressions g represents the coupling constant whose value 
controls the strength of the quark-gluon and gluon-gluon couplings. 

The strong interactions as given by the above lagrangian density are believed to 
bind the quarks and gluons into bound states, which correspond to the observed strongly 
interacting particles (or, hadrons), such as protons (p), neutrons (n), pions (tt), kaons 
{K), etc.. Table 2.1 lists the masses and some of the quantum numbers for all of the 
hadrons whose masses are less than 1 GeV. Gonsiderably more states have masses above 
1 GeV. 

For the present purposes the most signihcant feature about this particle spectrum is 
that the lightest two quarks, u and d, have masses which are much smaller than all of the 
masses of the states which make up the particle spectrum. This suggests that the QGD 
dynamics may be well approximated by taking mu,md ~ 0, and working perturbatively 
in these masses divided by a scale. A, which is typical of the strong interactions. From 
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Particle 

Quark Content 

Mass (GeV) 

Spin 

Isospin 

TT” (tT^) [tT®] 

duiud) [uu, dd] 

0.140 [0.135] 

0 

1 

7r° 

uu, dd 

0.135 

0 

1 

K+{K^) 

us ids) 

0.494 (0.498) 

0 

1 

2 


suisd) 

0.494 (0.498) 

0 

1 

2 

V 

uu, dd, ss 

0.547 

0 

0 


duiud) [uu, dd\ 

0.770 

1 

1 

u 

uu, dd, ss 

0.782 

1 

0 

K*+{K*^) 

us ids) 

0.892 (0.896) 

1 

1 

2 


suisd) 

0.892 (0.896) 

1 

1 

2 

rj' 

uu, dd, ss 

0.958 

0 

0 

fo 

uu, dd, ss 

0.980 

0 

0 

Oq 

uu, dd, ss 

0.980 

0 

1 

pin) 

uudiddu) 

0.938 (0.940) 

1 

2 

1 

2 


Table 2.1; Masses and Quantum Numbers of the Lightest Hadrons 


the observed bound-state spectrum we expect A to be 
The approximation for which rriu and rud vanish 
This is because the QCD lagrangian acquires the very 


roughly 1 GeV. 

turns out to be a very useful one. 
useful symmetries 

(“). ( 2 . 1 . 2 ) 


in this limit, where and Uu are arbitrary two-by-two unitary matrices having unit 
determinant. The Dirac matrix 7 ^ = i(l-|- 75 ) projects onto the left-handed part of each of 
the quarks, u and d, while 7 ^ = 1(1 — 75 ) projects onto their right-handed part. The group 
of symmetries which is obtained in this way is G = SUl{2) x SUii{2), with the subscripts 
‘L’ and ‘i?’ indicating the handedness of the quarks on which the corresponding factor 
of the group acts. A symmetry such as this which treats left- and right-handed fermions 
differently is called a chiral symmetry. These transformations are exact symmetries of 
QCD in the limit of vanishing niu and but are only approximate symmetries when 
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these masses take their real values. Because the approximate symmetry involved is chiral, 
the technique of expanding quantities in powers of the light-quark masses is called Chiral 
Perturbation Theory. 

If this symmetry, G, were not spontaneously broken by the QCD ground state, 
|r2), then we would expect all of the observed hadrons to fall into representations of G 
consisting of particles having approximately equal masses. This is not seen in the spectrum 
of observed hadrons, although the known particles do organize themselves into roughly 
degenerate representations of the approximate symmetry of isospin: SUj{2). The isospin 
quantum number, I, for the observed SUi{2) representations of the lightest hadrons are 
listed in Table 2.1. (The dimension of the corresponding representation is 2/-|-1.) Isospin 
symmetry can be understood at the quark level to consist of the diagonal subgroup of G, 
for which Ul = Un- That is, the approximate symmetry group which is seen to act on 
the particle states is that for which the left- and right-handed components of the quarks 
u and d rotate equally. 

This suggests that if QCD is to describe the experimentally observed hadron spec¬ 
trum, then its ground state must spontaneously break the approximate symmetry group 
G down to the subgroup H = SUi{2), for which: 



There is indeed good theoretical evidence, such as from numerical calculations, that the 
ground state of QCD really does behave in this way. 

Given this symmetry-breaking pattern, we know that the low-energy spectrum of 
the theory must include the corresponding Goldstone bosons. If G had been an exact 
symmetry, then the corresponding Goldstone bosons would be exactly massless. But 
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since G is only a real symmetry in the limit that and rrid both vanish, it follows 
that the Goldstone bosons for spontaneous G breaking need only vanish with these quark 
masses. So long as the u and d quarks are much lighter than the natural scale — A 1 
GeV — of the strong interactions, so must be these Goldstone bosons. Indeed, the 
lightest hadrons in the spectrum, tt^ and tt'^, have precisely the quantum numbers which 
are required for them to be the Goldstone bosons for the symmetry-breaking pattern 
SU,{2) y. SU,{1) ^ Sf/,(2).[| Particles such as these which are light, but not massless, 
because they are the Goldstone bosons only of an approximate symmetry of a problem 
are called pseudo-Goldstone bosons. 

Since the low-energy interactions of Golstone bosons are strongly restricted by the 
symmetry-breaking pattern which guarantees their existence, it is possible to experimen¬ 
tally test this picture of pions as pseudo-Goldstone bosons. The remainder of this chapter 
is devoted to extracting some of the simplest predictions for pion interactions which can 
be obtained in this way. The fact that these predictions successfully describe the low- 
energy interactions of real pions gives support to the assumed symmetry-breaking pattern 
for the ground state of the strong interactions. 

2.2 The Low-Energy Variables 

In order to proceed, we must hrst construct the nonlinear realization for the case G = 
SUl{2) X SUii{2) and H = SUj{2). To do so, we hrst write out the representation we 

shall use for the elements of each of these groups. Denoting the Pauli matrices, r = {rn}, 

^In fact, the next-lightest particles, K and ry, together with the pions have the quantum numbers to 
be the Goldstone bosons for the pattern SUl{3) x 5'17 b (3) ^ SUv{3), which would be appropriate in the 
limit that the lightest three quarks, u, d and s, were all massless. The unbroken subgroup here, SUv{3), 
again is the diagonal, handedness-independent, subgroup. 
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by Ti = 


01 \ fO -i\ j fl 0 

1 oj“‘^’‘==U -1 


, we write: 


9 = 


62 


lul-t 


62 


LJu-T 


e SU,{2) X SU^{2), 


( 2 . 2 . 1 ) 


and 


''=r’r 


( 2 . 2 . 2 ) 

We adopt here, and throughout the remainder of the chapter, an obvious vector notation 
for the three-component quantities 6 ^”, u^, etc.. 

In this representation, the Goldtone boson field becomes: 


u{e) = 


62 


^^e-r 0 


0 6 2 


-i0-r ’ 


(2.2.3) 


and the nonlinear H transformation, 7 , is: 


l{0,g) = 


62 ' 


0 62“-" 


(2.2.4) 


2.2.1 A Notational Aside 


Before passing to the nonlinear realization, we briefly pause to make contact between 
the variables as dehned here, and those that are often used in the literature. We have 
dehned the elements, g & G, the matrices U{6), and 7 ( 0 , g) in a block-diagonal form which 
emphasizes the left- and right-handed parts of the transformations: 

f gL 0\ f Uri6) 0 \ N (h{9,q) 0 


9 = 


V 0 / 


(2.2.5) 


VO Vo uM)' V 0 h{e,g) 

In terms of these quantities, the transformation law U ^ U = gU'y'^ becomes 
Ul = 9 LULh^ and ^ = gnUnh^. It is common practice to work with the composite 

quantity, S, for which the transformation rule does not depend on the implicitly-dehned 
matrix h. That is, if S = then S —S = gj^Sg]^. This transformation law has the 
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advantage of involving only explicit, constant matrices. In terms of the Goldstone boson 
fields, 9, we have = mJj, so S = 

It is possible, and often convenient, to reformulate all of the Goldstone boson self¬ 
couplings that are obtained elsewhere in this chapter in terms of this held S. It is not 
possible to express the Goldstone-boson couplings to other helds, y, in this way since the 
matrix 7 cannot be removed from the transformation law for these other helds. 

2.2.2 The Nonlinear Realization 

The nonlinear realization is now obtained by constructing both 6{6,g) and u = u{6,g), 
using the condition g U{9) = U{9) 7 . For the groups under consideration this construction 
may be performed in closed form by using the identity: 


exp id ■ f = cos q; 4 - id; ■ r sin a. 


( 2 . 2 . 6 ) 


where a = Vd ■ a, and d = d/a. 

Using this identity to multiply out both sides of the dehning equation g U{9) = 
U{9) 7 , and equating the coefficients of 1 and r, separately for the left- and right-handed 
parts of the matrices, gives explicit expressions for 69 = ^ and u. If g^^n = exp 


2 ^L,R ■ T 


and dehning oJi = and Qa = — 66n), then: 


^ ^ 9(9 9 \ 

^ = e X 6jy + -Uii\i- +coi-\ [66 A- 9{9-66 a)] + ^9 ■ 6j a), 

= uJa -\- 9 X Uy + 0(0^); 

9 

u = 66y + {9 X 66a) tan- 


— Uly -|- ■ 


9 X 66, 


+ 0{9^). 


(2.2.7) 


( 2 . 2 . 8 ) 


For future reference we notice that the transformation law for 9 implies that the 
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three broken generators of G = SUl{2) x SUii{2) form an irreducible, three-dimensional 
representation of the unbroken subgroup, H = SUj{2). 

Similarly evaluating the combination 

U'd,U=\e,-iJ ( 2 . 2 . 9 ) 

gives the quantities with which the invariant lagrangian is built; 

= (^) V-(")(«■ 

= d,e (i - ^ J 0 ■ 9 ,e) e + o{e^)- (2.2.10) 

4 - -2 (^) 

= -]^exd^e + o{e^). (2.2.11) 

Notice that is odd, and A^j, is even, under the interchange 9 —6. The low-energy 

Goldstone boson lagrangian will be required to be invariant under such an inversion of 9, 
since this is a consequence of the parity invariance of the underlying QCD theory. 

It is useful to also record here the G-transformation rules for the other fields which 
can appear in the low-energy theory. Of particular interest are the nucleons — neutrons 
and protons — since low-energy pion-nucleon interactions are amenable to experimental 
study. The nucleon transformation rules under G = SUl{2) x S'f/jj(2) are completely 
dictated by their transformations under the unbroken isospin subgroup, H = SUi{2). 
Since the nucleons form an isodoublet, N = they transform under isospin according 

to 6N = ■ f N. The rule for the complete G tranformations is therefore simply 

SN = ^u-fN. ( 2 . 2 . 12 ) 
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We therefore see that the appropriate covariant derivative for nucleons is: 


D^N = d^N-'- A^{e) ■ f N. (2.2.13) 

2.3 Invariant Lagrangians 


We may now turn to the construction of the invariant lagrangian which governs the low- 
energy form for pion interactions. The lagrangian describing pion self-interactions involv¬ 
ing the fewest derivatives is uniquely determined up to an overall normalizing constant. 
As was discussed in detail in the previous chapter, this is because of the irreducibility 
of the transformation rules of the broken generators, X = | r 75 , under the unbroken 
isospin transformations. The most general G-invariant lagrangian density involving only 
two derivatives is 

Ct,-,, = —— gmn{d) + (higher-derivative terms), (2.3.1) 


where the metric, gmn, on G/H is: 


(o\ _ S ^ ^ ft ~ 

9mn\P) — ^ m ^ n — ^mn 1 /^o I ~r 


02 


04 


= (l-^02)+^0„^0n + O(0"). 


(2.3.2) 


For applications to pion scattering it is useful to canonically normalize the pion 
helds, that is, to ensure that their kinetic terms take the form: — | d^n ■ d^Ti. This 
requires the rescaling: 9 = t:/F. With this choice we have: 


£ = -■ 


£2 sin^ 


(f) 


TT^ 


(9^7f ■ -I- 


■7r2 - £2 sin2 


(f) 


TT^ 


(tt ■ (9^7f) (tt ■ 


= — d,X ■ 


-|-(higher-derivative terms) 
1 


(2.3.3) 


2 F 2 


(tt ■ d^Tr) (tt ■ O^tt) + 0{7r^) + 
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An integration by parts has been performed in writing the vr^ term of the expansion of 
the lagrangian. 

The couplings between nucleons and pions to lowest order in the derivative expansion 
involve only one derivative. The most general form for these that is consistent with the 
nonlinearly-realized G-invariance, and with parity invariance, is: 

= -N {^- ^-4{9) ■ f + ^ e^(0), 

= + ^ (iVy'^ysriV)-a^Tf (2.3.4) 

■ (vf X a^7f) + ■ ■ •. 

The ellipses here represent terms which involve either three or more powers of the pion 
held, more than two powers of the nucleon held, or involve more than one derivative. 

Clearly, only the one constant F need be determined in order to completely hx the 
dominant low-energy pion self-interactions, and a second constant, g, is also required to 
determine the lowest-order pion-nucleon couplings. 

2.3.1 Conserved Currents 


For future reference it is instructive to compute the Noether currents for the symmetry 
group G = SUi^{2) X SUji{2) in both the underlying theory (i.e. QCD), and in the ehective 
low-energy pion-nucleon theory. 

In QCD, the symmetry transformation under G is given by = | {uJl'Jl + ^rIr) ■ 
f q, where q = denotes the two-component quantity containing the two lightest 

quarks. The corresponding Noether currents that are obtained from the QCD lagrangian, 
eq. (|2.1.1|) are: 

and Tn = '^^qi^lRr q- (2.3.5) 


fl=- q'j^'j^f g. 
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The current, which corresponds to the unbroken SUj{2) isospin symmetry is therefore: 

= Jl + jfl = f q. The current for the broken symmetry is similarly: j% = 

t qi^lbT q. 


In the effective pion-nucleon theory the corresponding current may also be con¬ 
structed using the known action of G on vf and N, and using the lagrangian, whose 


lowest-derivative terms are given by eqs. ( p.3.3|) and (|2.3.4|) . Keeping only the terms in¬ 
volving a single pion or only two nucleons, at the lowest order in the derivative expansion, 
then gives: 


= — ^TT X ^ N'f^r N + ■ ■ ■, 

+ + (2.3.6) 


There are an inhnite number of higher-order terms in these currents corresponding to the 
inhnite number of interactions in the effective pion-nucleon lagrangian. All of the terms 
not written explicitly above involve additional factors of the helds vf or N, or involve more 
derivatives of these helds than do the terms displayed. 


2.3.2 Determining F and g 

These expressions for the Noether currents for G turn out to furnish a handle for ex¬ 
perimentally determining the constants F and g. This is because, as is made explicit in 
the following section, experimental information exists concerning the value of some of the 
matrix elements of the broken current fX- 

This experimental information exists because it is precisely the current which 
appears in that part of the weak-interaction lagrangian which describes transitions from 
d quarks to u quarks. Since these transitions are responsible for many reactions, including 
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all nuclear /3-decays, free-neutron decay, and decay, the corresponding matrix elements 
of this current can be measured. 

The terms in the underlying lagrangian which describe these decays are obtained by 
supplementing the QCD interactions of eq. (| 2 . 1 . 1 |) with the weak-interaction term: 

Gp cos 6c 




V2 


-h 75)d I'tlyO- + 75 )^ + h.c.. 


(2.3.7) 


Here the Dirac spinor held ^ and the Majorana held respectively represent a charged 
lepton — in practice, the electron and muon — and the corresponding neutrino. Gp is 
the Fermi coupling constant, which is determined from the muon decay rate to be Gp = 
1.16649(2) X 10“^ GeV“^. The angle 6c is called the Cabbibo angle, and it parameterizes 
the fact that the size of the coupling constant, Gp cos 6c, as seen in superallowed nuclear (3- 
decays is smaller than Gp as is measured in muon decay. Numerically, cos 6^0 = 0.9753(6). 


The main feature to be noticed from eq. ( p.3.7|) is that the quark combination which 
appears is a linear combination of the conserved SUp{2) x SUp{2) currents: 


m 7 ^(l -h 75 )d = + iT 2 ) q 


(2.3.8) 


In preparation for using eqs. ( p.3.6|) we have re-expressed the left-handed currents which 
appear in the weak interactions in favour of the axial and vector currents using: = 

To compute the decay rate for the reaction we require the following ma¬ 

trix element: {pG, i'y\Cy^eak\'^^)■ The part of this matrix element which involves strongly- 
interacting particles is where |D) is the QCD ground state. The isospin current, 

does not appear in 7 r+ decay because its matrix element vanishes due to the parity 
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invariance of the strong interactions. The most general form for this matrix element which 
is consistent with Poincare and isospin invariance is given by: 

WjA)^{x)= J ^ ^mn, (2.3.9) 

where it is conventional to extract the numerical factor 1 /(27r)^2, and the pion states 
are labelled here as members, |7r„) {n = 1,2,3), of an isotriplet. These are related to 
the physical states, having dehnite electric charge, by: |7r=*") = ^ (Itti) =f 'i\'^ 2 ))-i and 
k°) = ks). 

The only unknown quantity in this matrix element is the constant TV, which is in¬ 
ferred to be Ft, = 92 MeV by comparing the prediction, 1/rth = cos^ 6cF^rn?^mT,/An) (1— 

with the observed mean lifetime, Texp = 2.6030(24) x 10“® s, for the decay 

7r+ ^ 

Now, to lowest order in the derivative expansion, the matrix element of eq. ( p.3.9 ) 
can be directly evaluated as a function of the parameter F using the second of eq. ( |2.3.6| ). 
Comparing these results permits the inference 


F = F^ = 92 MeV. 


(2.3.10) 


With this constant in hand, we may now use the low-energy effective lagrangian to predict 
the low-energy pion self-interactions. 

Before proceeding to these predictions, we hrst repeat these steps for another matrix 
element in order to infer the value of the constant, g, which governs the size of the pion- 
nucleon coupling. We once again consider the weak interaction, eq. (p.3.7|) , but this time 
consider its prediction for the decay rate of a free neutron into a proton, an electron and an 
antineutrino: n —> peW- In this case, the most general Poincare-, parity-, time-reversal- 
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and isospin-invariant form for the desired matrix element is: 


{N{k,aM{x)\N{l,0) 


{N{k,amx)\N{lX)) 



(2.3.11) 


Here, and k^ are the fonr-momenta of the initial and hnal nncleons, and = {I — k)^ 


is their difference. ( and a similarly represent the spins of the initial and hnal nncleons. 
u{k, a) is the Dirac spinor for a free particle having momentnm k^, with k"^ -|- m\ = 0, 
and spin a. (Onr normalization is: u{k, a)u{k, a') = {rrij^/k^) Finally, 7 ^^ stands 

for the commntator ^ [ 7 ^, 7 *^]. 

The nnknowns in this matrix element are the fonr Lorentz-invariant fnnctions, 
Fi,F 2 ,Gi and G 2 , of the invariant momentnm transfer, q^. These fnnctions are not 
completely arbitrary, however, since they mnst encode the fact that we are working in 
a limit where G = SUr,{2) x SUnX) is taken to be a symmetry of the QCD lagrangian. 
The implications of G-invariance are easily extracted by demanding cnrrent conservation, 
dfj.X' = dfj,X = 0, for all of the cnrrents. Keeping in mind that the nncleons have eqnal 
mass in the G-invariant limit in which we are working, this implies no conditions on the 
fnnctions Fi and F 2 , bnt implies for the others: 


2im^ Gi(g^) = G2(g^). 


(2.3.12) 


In the rest frame of the decaying nentron, the components of the momentnm transfer, 
q^, are at most of order 1 MeV. Since this is mnch smaller than the typical strong- 
interaction scale, A ~ 1 GeV, which characterizes the matrix element, for nentron decay 


it snffices to simplify eqs. (|2.3.11|) nsing q^ ^ 0. In this approximation the nentron decay 


rate depends only on the two nnknown constants, Ki(0) and Gi(0). 
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0 


--0 


Figure 2.1: The Feynman graphs which give the dominant nucleon matrix elements of the 
Noether currents in the low-energy effective theory. Solid lines represent nucleons, and 
dashed lines represent pions. 


Since the constants F\(0) and Gi(0) correspond to the low-energy limit of these cur¬ 
rent matrix elements, they may be related to the constants which appear in the dominant 
terms of the low-energy effective lagrangian. This may be done by using eqs. (p.3.6|) to di¬ 
rectly evaluate the matrix elements of eqs. (|2.3.11|) . Doing so, we hnd contributions from 
the two Feynman graphs of Fig. 2.1. The hrst of these gives the direct matrix element of 
eqs. ( 12.3.61) , and contributes to the form factors Fi and Gi. The second graph uses the 


NNtt interaction of the effective lagrangian, eq. (p.3.4|) , together with the vacuum-pion 


matrix element of eq. (|2.3.9| ) . It contributes only to the form factor G 2 ■ Evaluating these 
graphs, we hnd: 


Fi = 1, Gi = g, and G 2 = 


2igmj, 

q2 


(2.3.13) 


from which we see Ei(0) = 1 and Gi(0) = g. The factor 1/q^ in G 2 comes from the 
massless pion propagator in the second of Figs. 2.1. Notice that this result for G 2 is 


precisely what is required to satisfy the current-conservation condition of eq. (|2.3.12|) . 

The hnding that Fi (0) = 1 states that this part of the matrix element is not renor¬ 
malized by the strong interactions, since this value for F\(0) is the same as would have 
been obtained if the matrix elements of were taken using the underlying quark states 
rather than with the composite nucleon states. Ei(0) is the same for both quarks and 
nucleons because Ei(0) is the quantity which determines the matrix elements in these 
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states of the conserved isospin charges, X = / d^r fj. But these have matrix elements 
which depend only on the SUj{2) transformation properties of the states whose matrix 
elements are taken. Since both quarks and nucleons are isodoublets, and since inspection 
of eq. (|2.3.5| ) shows that quarks have Xi(0) = 1, the same must be true for nucleons. 

The same argument does not hold for the axial current because this is a current for a 
symmetry which is spontaneously broken. This turns out to imply that the corresponding 
conserved charge is not well dehned when acting on particle states, and so Gi (0) need not 
be unity. 

We hnally arrive at the desired conclusion: the neutron decay rate, which is com¬ 
pletely determined by the constants Xi( 0 ) = 1 and 6 * 1 ( 0 ) = g, can be used to experi¬ 
mentally infer the numerical value taken by the remaining constant, g, of the effective 
lagrangian. The measured neutron mean life — which is Texp = 887(2) s — then implies 
g = 1.26. 

Having determined from experiment the values taken by F and g, we are now in a 
position to use the effective pion-nucleon lagrangian to predict the low-energy properties 
of pion-pion and pion-nucleon interactions. 

2.3.3 The Goldberger-Treiman Relation 

Historically the trilinear N — N — tt interaction has been written with no derivatives, as 
a Yukawa coupling: 

^NNTT — Wnnit {dd ysT N) ■ TT, (2.3.14) 

with the constant g^N-K found from phenomenological studies to be close to 14. But the 
value of this constant can be predicted in terms of the constant g, and this prediction 
serves as the hrst test of the low-energy pion-nucleon lagrangian. 
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The prediction starts with the trilinear N — N — interaction of eq. ( 2.3.4 ): 


^ N NTT 


w 

2K- 




(2.3.15) 


and performs an integration by parts to move the derivative to the nucleon helds. One 
then uses the lowest-order equations of motion for N: i.e. {^ + rrij^) iV = 0, to simplify 
the result. One obtains a result of the form of eq. ( p.3.14|) , but with 


QnNtt 


gm^ 
F. ■ 


(2.3.16) 


Using the experimental values: g = 1.26, rrij^ = 940 MeV and = 92 MeV gives the 
prediction gNNiz = 12.8, which agrees well with the phenomenologically inferred value. 
This prediction, eq. (|2.3.16| ), is known as the Goldberger-Treiman relation. 

We turn now to one last dangling issue which remains to be addressed before we 
can compute low-energy pion-pion and pion-nucleon scattering. 


2.4 Explicit Symmetry-Breaking 


Notice that the effective lagrangian, eqs. ( 2.3.3 ) and (|2.3.4|) , has very dehnite implications 
for the masses of the pions and nucleons. It states that the pion multiplet must be exactly 
massless, and that the nucleon masses must be equal. Since these predictions rely only on 
the assumption of unbroken G invariance, and since G-invariance only holds for QCD in 
the limit that and vanish, corrections to the pion and nucleon mass predictions can 
only be inferred by including the effects of the symmetry-breaking quark mass terms for 
the low energy effective theory. We do so, in this section, to lowest order in the light-quark 


masses. 


The quark mass terms in the QCD lagrangian are proportional to q q + h.c., 
where M = { ^ ^ j is the light-quark mass matrix. Under the G = SUl{2) x SUii{2) 
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symmetry, q — {q^Il + Q, this transforms into: 

q M'yi^ q-^q q + h.c.. (2.4.1) 

Althongh this is not invariant, it would have been invariant if the mass matrix had been 
a held which had also transformed nnder G according to: M g^Mg^. 

We imagine the effective pion-nucleon theory having an expansion in the light qnark 
masses, M: £efr = .Co + /li + - • •, where the snbscript indicates the power of M it contains. 
Each of these terms may be separately expanded in powers of the derivatives, and of the 
helds TT and N. The construction to this point has given the lowest-derivative terms which 
can appear in Cq. Our goal now is to determine the most general form which may be 
taken by £i, and which contains no derivatives of any fields. This will give the dominant 
symmetry-breaking contribution at low energies. 

2.4.1 Pions Only: Vacuum Alignment 

We start by focussing on the part of Ci which depends only on the pion helds. The 
form taken by Ci may be obtained from the following argument. We require that £i be 
G-invariant, but only if we take M —> g^Mgl in addition to transforming the helds tt in 
their usual way. 

It is straightforward to construct one such a term involving only the pion helds. The 
simplest construction is to use the quantity S = = cos 9 + i9 ■ r sin 9, dehned 

in section (8.2.1), which transforms according to S —S = g^'^gl^. (Recall here that 9 
and 9 are dehned by 6* = \l9 ■ 9 and 9 = 9/9.) 

A possible lagrangian therefore is: 

= -A Re Tr [M E] - B Im Tr [M S] (2.4.2) 


67 



(2.4.3) 


= -A (rriu + md) cos 6 - B (m„ - rud) O 3 


sin 6 


Clearly this generates a potential energy which is a function of 6 , as is possible because 
of the explicit breaking of the SUi^{ 2 ) x SUii{ 2 ) symmetry by the quark masses. As a 
result, all values for 6 are not equally good descriptions of the vacuum, and it is necessary 
to minimize the potential in order to determine the vacuum value for 9. This choosing 
of the vacuum value for the pseudo-Goldstone fields after the introduction of explicit 
symmetry-breaking is a process known as vacuum alignment. 

In the present instance the potential is minimized by 61 = 62 = 0, and has the 
schematic form V{9) = —Ci{9) = A cos 9 + B sin 9 = —\A \ cos{9 — 9 q), for 9 = 9^ and A 
and B (or, equivalently, A and 9o) constants. This, once minimized (giving 6'min = 9o) 
and expanded about the minimum (with 9 = 6* min + 9'), the potential becomes V{9') = 
— |A| cos 9', leaving our lagrangian density of the form: 


C 


l.TTTT 


Tr[M (S + St)], 


= {Tnu + md) Ai^ cos 6*, 
,2 


(2.4.4) 


= m^ 


i 7f ■ vf - (vf ■ 7f)2 + 0(7r®) 


with the constant A4^ positive. 

Eq. (12.4.4 ) gives the required symmetry-breaking interaction, where the tt’s are 
chosen so that the vacuum is at vf = 0, and, in the last line, we have also eliminated 
the arbitrary parameter, Ai, which has the dimensions of mass, in terms of the common 
mass, tUtt, we find for all three pions: 


ml = (m„ md) 


F3 


(2.4.5) 


There are several features here worth highlighting. Firstly, notice that all of the pion 











self-interactions necessarily preserve isospin to this order in the derivative and quark-mass 
expansions. This implies, among other things, degenerate masses for all three pions. This 
preservation of isospin does not rely on the isospin-breaking difference, rUu — being 
small in comparison with or nid- Rather, it relies only on and rud both being small 
compared to the characteristic scale of QCD. We must look elsewhere for an understanding 
of the observed mass difference between the charged and neutral pions, such as to the 
isospin-breaking electromagnetic interactions. 

Secondly, the lagrangian of eq. (|2.4.4|) necessarily implies a quark-mass-dependent 
contribution to the vacuum-energy density, —pv = = 0) = rn^F^ = (m„ -|- 

md)M.^- This contribution permits a physical interpretation for the parameter M., as 
follows. In the underlying theory the derivative of the total vacuum energy, py, with 
respect to any quark mass is given by: 


dpv 

dm. 




(2.4.6) 


where |fl) is the QCD ground state. We see, by comparison with the pion scalar potential, 
eq. (12.4.41) , that 

= (D|dd|D) = -M^ + ■ ■ ■, (2.4.7) 


where the ellipses here denote the contributions due to quantum effects in the low- 
energy pion-nucleon theory. Evidently M. ~ gives the size of 

the expectation value which is responsible for the spontaneous breaking of the chiral 
SUl{2) X SUr{2) symmetry. Using the values ruvr = 140 MeV, Tn- = 92 MeV and 
4.5 MeV < < 14 MeV then gives 230 MeV < M. < 330 MeV for this scale. 

Next, we remark that since none of the terms which appear in depend on 

derivatives of vf, they do not at all affect expressions (|2.3.6|) for the conserved Noether 
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currents of the theory. We therefore need not at all change the above analysis which 
determined the experimental values for the constants F and g from pion and nucleon 
weak decays. 

Finally, we note that the mass term given in eq. (|2.4.4| ) is the only possible term (up 
to normalization) which is linear in M and depends only on 6*, and not on its derivatives. 
This uniqueness follows from the impossibility of building a G-invariant scalar potential. 
To see this, we write Ci = Tr (M O) + h.c., for 0{9) a two-by-two matrix function of the 
Goldstone boson helds. O must transform under G according to O ^ g^Ogl^. Oi = E 
satishes these conditions, but suppose 0 = 02 were a second, independent solution. In 
this case, the combination V{6) = Tr [OiO^J or V(9) = det [OiO^] would be a G-invariant 
scalar potential, as would any of the eigenvalues of the matrix O^O^- Since we know from 
the previous chapter no such potential is possible, it follows that an independent quantity, 
O 2 , also cannot exist. 

2.4.2 Including Nucleons 


We next consider the part of £1 which involves precisely two factors of the nucleon held, 
and no derivatives. That is: Ci^n-k = —N f{9,M) -|- h.c., where the transformation 

laws: 9^9 and iV —>■ iV = h{9,g)N imply that the matrix-valued function, f{9,M), 
must satisfy: f{9,giiMg\) = h f{9,M) hb 

The solution, unique up to normalization, to this condition is: / = where 


Mi = ujj = exp 


9 ■ T 


. We therefore hnd: 


£l,Af7r — —A N 




N + h.c. 


= -XNMN-i\ 


'^sin 6*^ 


N{9-T,M]-f5N 
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N[e- fMe- f+9‘^M 


N 


(2.4.8) 


= -XN M N-t^m] N 


2F^ 


A 


4F2 


N ■ rMn ■ t + 7 i ■ ti M N + ■ ■ ■. 


As usual, curly braces denote the anticommutator of the corresponding matrices: {A, B} = 
AB + BA. 

Notice that besides providing nonderivative pion-nucleon couplings, this term also 
splits the neutron and proton masses by an amount: 


6xmt, = X{md - m„). 


(2.4.9) 


Even though they do not contribute to the pion mass splittings, the differing u and d 
quark masses do act to split the masses of the nucleon isodoublet. Now, and rrid 
may be determined by repeating the above analysis for the masses of the lightest eight 
mesons, tt, K, rj, under the assumption that these are all pseudo-Goldstone bosons for the 
symmetry group SUi^{3) x SUji{3) which is appropriate when the s quark is assumed to be 
light in addition to the u and d quarks. In principle, once this has been done, eq. ( ^.4.9| ) 
permits the constant A to be extracted from the experimental difference, rUn — rup = 
1.293318(9) MeV. It is important in so doing to include also the contributions of the 
electromagnetic interactions to this mass difference, since these are similar in size to 
eq. (HD.! 


2.5 Soft Pion Theorems 


We may now proceed to work out some of the implications of the effective lagrangian 

for low-energy pion-pion scattering. As usual, the hrst question must be to ask which 
A thank John Donoghue for reminding me of the importance of this electromagnetic contribution. 
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interactions need be considered in which Feynman graphs in order to properly mimic the 
low-energy expansion of the underlying QCD theory. To this end we use the powercounting 
results of Chapter 1. 

2.5.1 Power Counting 

For simplicity we consider here only the case where there are no nucleons in the low-energy 
theory, since only in this case we can directly use the power-counting results obtained in 
Chapter 1. As has already been emphasized, these results cannot be directly applied 
to nucleons, because they were derived under the assumption of very light, relativistic 
fermions, and the nucleons in the low-energy pion-nucleon theory are very massive and 
nonrelativistic at the energies of interest. 

Power Counting in the Symmetric Limit 


We start by omitting all symmetry-breaking terms of the pion lagrangian which are pro¬ 
portional to the quark masses. These are considered in the next section. In this case the 


pions are massless, and their Goldstone-boson lagrangian has the form given in eq. (|2.3.1| ): 
■f 


= -F2 j 9„„(9) 8„9”c)'‘r + d^ff’cTe’’ + ■■■ 


(2.5.1) 


where c is a dimensionless number which is, in principle, calculable from QCD. The ellipses 
represent an inhnite sequence of additional terms, including several others which also have 
four derivatives, as does the displayed term proportional to c. 

Eq. ( |2.5.1|) has the form of eq. (|1.1U.1|) , with: / = v = and M = ^ 

AtiFt, ~ 1 GeV. The powercounting estimate of eq. (|1.10.9|) for the scattering of pions 
becomes: 


AM - FM 


IT X 






V 


AttF^ 


A. 

,Av 


(2.5.2) 
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which is a famous result, due hrst to Weinberg. 

For a given observable, ^^(g), the number, E, of external particles is hxed. In 
this case it is only the last two factors of eq. ( p.5.2|) which differentiate different types of 
contributions. We remark that in practical applications for pion scattering, it happens 
that ~ AttFt, ~ 1 GeV. As a result, the second-last factor, {A^/, turns out 
to be 0(1) for realistic pion scattering. This means that it is only the last factor which 
controls the importance of various interactions. 

According to eq. ( 2.5.2 ), the contribution of higher-derivative interactions is clearly 
only suppressed by the ratio q/A^, which limits us to considering only low-energy pion 
dynamics near threshhold. The dominant term in the expansion in powers of q/A^ corre¬ 
sponds to choosing the smallest possible value for the quantity P = 2 + 2L + J2ik{^ — ‘^)yik- 
It is noteworthy, when using this expression, to remark that all of the interactions have at 
least two derivatives (we temporarily ignore pion masses etc.), and so k > 2. Furthermore, 
it is only the hrst term in the derivative expansion which has k = 2, and so the k = 2 
interaction is unique. 

As a result the lowest value possible for P is P = 2, and this is only possible if 
L = 0 and if 1/)^ = 0 for all fc > 2. This implies that the dominant contribution to pion 
scattering is computed by using only the hrst term in the ehective lagrangian, eq. ( 2.5.1| ), 
and working only to tree level with these interactions. 

The next-to-leading terms in g/A^ then have P = 4, which can arise in either one of 
two ways, [i) We can have L = 1 and Vik = 0 for all k > 2] or {ii) we can have L = 0 and 
Vii = 1 for some i, while 1^2 can take any value. This states that the subleading, O(g^), 
contribution is obtained by either working to one loop order using only the interactions 
of the hrst term of the lagrangian of eq. (|2.5.1|) , or using tree graphs having exactly one 
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vertex taken from the four-derivative interactions in the lagrangian, as well as any number 
of interactions from the hrst term in the lagrangian. 

In this way it is clear how to compute any given order in the expansion in powers 
of q/k^. 

Symmetry-Breaking Terms 


Before proceeding to calculations, we must also include one other feature. We must track 
the appearance of the explicit symmetry-breaking terms, of which we only keep those 
which are proportional to a single power of the light-quark masses, rriq ~ mu^md- These 
vertices, which come from the symmetry-breaking term £i, can be very simply included 
into the power-counting results of Chapter 1 by considering all of the non-derivative 
interactions to be suppressed by a dimensionless coupling, Ck=o ~ (mg/A^) ~ (m^ My- 
With these points in mind, eq. (|2.5.2|) becomes: 






(2.5.3) 


where P can be written in either of two equivalent ways: 


P = 4-E + J2{k + t-4)V,k, 

ik 

= 2 + 2L + ^ (fc - 2) Vik. (2.5.4) 

ik 

Using the hrst of these forms for P we see that the contribution to the powercounting 
estimate due to the insertion of the symmetry-breaking terms with fc = 0 is: 



The dangerous interactions are clearly those for which i < 4. For example, the pion mass 
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Figure 2.2: The Feynman graphs which give the dominant contributions to pion-pion 
scattering in the low-energy pion-nucleon theory. The hrst graph uses a vertex involving 
two derivatives. The second involves the pion mass, but no derivatives. 


term, ~ has i = 2, and so contributes the factor 


n(^' 

k=0 \ y 


Vik 


(2.5.6) 


Now, we are interested in applications for which the external momenta, q, are of 
order several hundred MeV, and so g ~ 771^,. ~ For these momenta the factor 

{rriqAJq^) ~ (ml/q^) rsj 0(1). It follows that it is not a good approximation to perturb 
in the pion mass term, and so we should include this term in the unperturbed lagrangian. 
That is, we should include the pion mass explicitly into the pion propagator so that 
Gn{q) = — *e)- It is legitimate to perturb in all of the other symmetry¬ 

breaking interactions of the scalar potential, however, since for these /c = 0 and z > 4. 

We now turn to specihc interactions, starting with pion-pion scattering, for which 
E = A. In this case the above powercounting shows that there are precisely two dominant 
contributions. The hrst of these consists of the tree graph of Fig. 2.2, using the four- 


point vertex from the G-invariant term which involves two derivatives, eq. (|2.3.3|) . The 
second contribution is also obtained using the graph of Fig. 2.2, but this time takes the 
four-point pion self-interaction from the symmetry-breaking scalar potential of eq. ( |2.4.4| ). 
Although the hrst term is unsuppressed by the light-quark masses, it gives a contribution 
which is down relative to the second term by two powers of external momenta, q. Both 
are therefore comparable in size for pions near threshhold, q^ ~ m^. All other graphs are 
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smaller than these two by powers of either rriq or q. 

2.5.2 Pion-Pion Scattering 


We now compnte pion-pion scattering by evalnating the graphs of Fig. 2.2 nsing the 
effective pion self-conplings of eqs. (p.3.3|) and p.4.4| . A straightforward calcnlation gives 
the following S matrix element for the scattering 




i5^{Pa+Pb-Pc-Pd) 

W^x/PaPbPcPd 




ah.cd') 


(2.5.7) 


with 


•^ab,cd 


F2 L 


^ab^cd ("S ^ tt ) 4” ^ac^bd (t ^n) 4~ ^ad^bc ^ tt ) 


(2.5.8) 


where the Lorentz-invariant Mandelstam variables, s = —{pa +Pb)‘^, t = —{pa — Pc)"^ and 
u = —(pa— Pd)"^ are related by the identity: s + t + u = 4m^. In the CM frame s, t 
and u have simple expressions in terms of the pion energy, E, and three-momentnm, q: 
s = AE'^, t = —2E‘^ + 2q^ cos'd and u = —2E‘^ — 2q^ cos'd. Here'd denotes the scattering 
angle, also in the CM frame. 

Comparison with the data is made nsing channels having dehnite angnlar momentnm 
and isospin. If we decompose Aab,cd into combinations, , having dehnite initial isospin: 


Aab,cd — -SabScd + “ (dac^M — daddfec) + 


^{SacSbd + SadSbc) ^SabScd^ 


, (2.5.9) 


then 


,(o) _ 2s-< aW 4(2)= s-2m! 


F2 ’ F2 ’ "cd T^2 

TT '^77 T" 

The next step is to resolve these amplitndes into partial waves: 


(2.5.10) 


A[^'^ ^ 5 ^ J ^cos'd (cos'd) 


(2.5.11) 
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Parameter Leading Order 

Next Order 

Experiment 

Uq 

7ml/?>2'KFl 0.16 

0.20 

0.26(5) 

% 

ml/A-KFl 0.18 

0.26 

0.25(3) 

a\ 

m2/247rF2 0.030 

0.036 

0.038(2) 

al 

— ml/lQTTFl -0.044 

- 0.041 

- 0.028(12) 

hi 

— ml/SirF^ -0.089 

-0.070 

-0.082(8) 

Table 2.2: 

Theory vs Experiment for Low-Energy Pion Scattering 


where Pi{cos-^), as usual, denote the Legendre polynomials (so Po{x) = 1 and Pi{x) = 
x). Since all of the dependence on 'd appears through the variables t and u, and since 
eqs. (p.5.10|) give and as functions of s only, it is clear that only the partial wave 
£ = 0 is predicted at lowest order for the even isospin conhgurations. Also, since is 
strictly linear in cos'd, it only involves the partial wave ^ = 1. 

The actual comparison with the data is made by expanding the (real part of) 
in powers of the squared pion momentum: /ml. = E’^/rnl — 1 = (s — 4m^)/4m^. That 
is, writing 

..s f AV f A \ 

(2.5.12) 


A' = 


+ bi —+ ■ ■ ■ 


m 


dehnes the pion scattering lengths, a^, and slopes, h\. Applying these dehnitions to 
eqs. ( |2.5.10| ) gives the predictions of the second and third columns of Table 2.2. Column 
three gives the numerical value corresponding to the analytic expression which is given in 
column two. The predictions including the next-order terms in the q'^/A^ expansion have 
also been worked out, and are given in the fourth column of this Table. [| 

Comparison of these predictions with experiment is not straightforward, since it is 
not feasible to directly perform pion-pion scattering experiments. Instead, the pion-pion 


have taken these values from the excellent book Dynamics of the Standard Model by Donoghue, 
Golowich and Holstein (see bibliography). 
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scattering amplitudes at low energies are inferred from their influence on the dual state 
in other processes, such as K ^ nneue oi’ nnN. The experimental results, as 

obtained from kaon decays, for those quantities which are predicted to be nonzero at 
lowest order are listed in the right-hand-most column of Table 2.2. Data also exist for 
other partial waves which are predicted to vanish at lowest order, such as / = 0,i = 2, 
and these are found to be in good agreement with the nonzero predictions which arise at 
next-to-leading order in the low-energy expansion. 

This example nicely illustrates the predictive power which is possible with a low- 
energy effective lagrangian, even if it is impossible to predict the values for the couplings 
of this lagrangian in terms of an underlying theory. This predictive power arises because 
many observables — e.g. the pion scattering lengths and slopes — are all parameterized 
in terms of a single constant — the decay constant, TV — which can be extracted directly 
from experiment. We emphasize that this predictive power holds regardless of the renor- 
malizability of the effective theory. Computing to higher orders involves the introduction 
of more parameters, but predictions remain possible provided that more observables are 
computed than there are parameters to fix from experiment. The information under¬ 
lying these predictions comes from the symmetries of the underlying theory, as well as 
the restrictions due to the comparatively small number of possible interactions which can 
appear at low orders of the low-energy expansion. 
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Chapter 3 

Magnons: Nonrelativistic 
Applications 


We now turn to a second application, this time to a nonrelativistic system. Besides once 
again illustrating the utility of the effective-lagrangian techniques, this example shows 
how the analysis can be applied to more complicated condensed-matter systems. It also 
illustrates how effective lagrangians permit the separation of the generic predictions which 
follow only from general properties such as the symmetry-breaking patterns, from the 
details of the models which may be used to establish these symmetry-breaking patterns 
from the underlying physics. 

We take as our application the macroscopic behaviour of ferromagnets and antifer- 
romagnets. These systems exhibit a transition at low temperatures to a phase which is 
characterized by a bulk order parameter, which we call S for the ferromagnet and N for 
the antiferromagnet, which transforms under rotations as a vector. For ferromagnets this 
order parameter can be taken to be the overall magnetization of the sample. Because this 
order parameter spontaneously breaks the rotational symmetry, Goldstone bosons must 
exist and so must appear in any low-energy (or long-wavelength) description of these sys¬ 
tems. It is the low-energy interactions of these Goldstone bosons which is described in 
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this chapter. 

The distinction between a ferromagnet and an antiferromagnet requires more infor¬ 
mation concerning the underlying material. As for most condensed-matter systems, the 
underlying microscopic system consists of an enormous number of electromagnetically in¬ 
teracting electrons and atomic nuclei. One picture of what is going on in a ferromagnet 
or antiferromagnet consists of imagining the electrons being reasonably localized to their 
corresponding atoms, with these atoms carrying a net magnetic moment due to its having 
a net electronic spin. The electron in each atom which carries the net spin interacts with 
its counterparts on neighbouring atoms, resulting in (among other things) an effective 
spin-spin interaction between these atoms. This spin-spin interaction can come about 
due to the exchange part of the Coulomb interaction, which arises due to the antisym- 
metrization of the wavefunction which is required because of the statistics of the electrons. 
Under varying circumstances one might suppose this spin-spin interaction to either favour 
the mutual alignment of neighbouring spins, or their anfzalignment (where the spins line 
up to point in opposite directions). 

The behaviour of such mutually interacting electronic or atomic spins may then be 
investigated by abstracting out just this spin dynamics into a simplifying model. For ex¬ 
ample, the interacting electrons can be replaced by a system of spins which are localized 
to each of the lattice sites which define the nuclear positions in the solid. The mutual in¬ 
teractions of the atoms can be reduced to a spin-spin coupling having a phenomenological 
sign and magnitude, according to whether it is energetically favourable for neighbouring 
spins to be aligned or antialigned. Such models show that at low temperatures macro¬ 
scopic numbers of these spins tend to either align or antialign, according to which of these 
takes less energy. A system for which neighbouring spins tend to align, prefers to acquire 
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Figure 3.1: The Distinction Between Ferromagnets (F) and Antiferromagnets (AF). 

a net magnetization since the magnetic moment of each atom adds to give a macroscopi- 
cally large total. This is a ferromagnet. If neighbouring spins prefer to antialign, then the 
order of the ground state consists of spins which alternate in their alignment, with every 
other spin pointing in a hxed direction, and the others pointing in the opposite direction. 
Such an arrangement is called antiferromagnetic. These two alternative arrangements are 
pictured in Fig. 3.1. 

The statistical mechanics of such spin models can successfully describe many fea¬ 
tures of real ferromagnets and antiferromagnets. So long as calculations are based on 
models, however, it is difficult to quantitatively assess their accuracy. One of the pur¬ 
poses of the present chapter is to show that some predictions for these systems are very 
robust, since they do not rely on more than the qualitative features of the models. The 
robust predictions are those which can be formulated completely within the framework 
of a low-energy effective theory, and which therefore rely only on the spectrum and sym¬ 
metries which dominate at low energies. The accuracy of this kind of prediction can be 
quantitatively assessed since this accuracy is controlled by the domain of validity of the 
effective theory itself. The role played in this kind of calculations by the details of an 
underlying model is simply the prediction of the quantum numbers and symmetries of the 
low-energy degrees of freedom, and so the model need only get these qualitative features 
right in order to accurately reproduce the proper low-energy behaviour. 
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This line of reasoning, in which some quantitative predictions can be justified as 
general low-energy features of a given system can be of great practical importance. For 
example, some very high-precision measurements are now based on the macroscopic be¬ 
haviour of complicated condensed matter systems. Examples are the Josephson effect, 
or the Integer Quantum Hall effect, both of which have been used to £x the best mea¬ 
sured value for the electromagnetic fine structure constant, a. These determinations are 
accurate to within very small fractions of a percent. We should only believe such a deter¬ 
mination of a if we can equally accurately justify the theoretical predictions of the effects 
on which the determination is based — a very tall order if the prediction is to be based 
on a model of the underlying system. Happily, such an accuracy is possible, and is one of 
the fruits of an effective lagrangian analysis of these systems. 

We now turn to the application of these effective-lagrangian techniques to the de¬ 
scription of the long-distance, low-energy behaviour of the ordered spin systems. 

3.1 Antiferromagnetism: T Invariance 

We start with applications to the low-energy properties of antiferromagnets. We do so 
because antiferromagnets preserve a type of time-reversal symmetry, which makes the 
analysis of its low energy behaviour fairly similar to what would apply for relativistic 
systems. 

For an antiferromagnet, the order parameter, N, can be taken to be the staggered 
sum of the spins, Sj, for each lattice site, ‘i’: 

N = y(-)'Si, (3.1.1) 

i 

where the sign, (—)*, is positive for one sublattice for which all spins are parallel in the 
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ordered state, and is negative for the other sublattice for which all spins are antiparallel 
with those of the first lattice. We shall refer to these sublattices in what follows as the 
‘even’ and ‘odd’ sublattice respectively. By taking such an alternating sum we hnd the 
expectation (N) 0 in the system’s ground state, Iff). 

The action of time-reversal invariance, T, is to reverse the sign of the spin of every 
site: Sj —> —s,. Although this transformation also reverses the order parameter, N ^ 
—N, it may be combined with another broken symmetry, S, to obtain a transformation, 
T = TS, which is a symmetry of N. This other symmetry, S, consists of a translation 
(or shift) of the whole lattice by a single lattice site, taking the entire ‘even’ sublattice 
onto the ‘odd’ sublattice, and vice versa. Since both S and T act to reverse the direction 
of N, they preserve N when they are performed together. 

We next turn to the construction of the general low-energy lagrangian for the Gold- 
stone bosons for the breaking of rotation invariance — called magnons — for these sys¬ 
tems. 

3.1.1 The Nonlinear Realization 

The hrst step is to identify the symmetry breaking pattern, G ^ H. At hrst it is tempting 
to assume that the role of G should be played by the spacetime symmetries, since these 
include rotations. This is not correct, however, for several reasons. Firstly, the spacetime 
symmetries of a lattice do not consist of the full group of translations and rotations since 
these symmetries are broken by the lattice itself. The unbroken subgroup consists only of 
the group of lattice symmetries: i. e. those translations and rotations which take the lattice 
to itself. There are indeed Goldstone bosons for the spontaneous breaking of translational 
and rotational symmetry down to this lattice group, but these are the phonons and are 
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not the focus of the present analysis. 

In fact, the rotations of the spins on the lattice can be taken to be an internal SU{2), 
or SO{3), symmetry, rather than a spacetime symmetry. This is because the action of 
rotations on the intrinsic spin of a particle becomes an independent internal symmetry, 
separate from spacetime rotations, in the limit that the particle involved is nonrelativistic. 
This is because all of the interactions which couple the orbital angular momentum with 
the spin angular momentum vanish in the limit that the particle mass tends to inhnity. 
Since the spins of interest for real systems are those for nonrelativistic electrons or atoms, 
we may consider the broken symmetry group to be an internal symmetry, G = SU (2) 
(which equals G = SO{3), locally). In real life, the electron mass is not inhnite, so there 
are small ‘spin-orbit’ effects which really do break the internal spin symmetry. These 
introduce small corrections to predictions based on this symmetry, such as the exact 
gaplessness of the Goldstone mode. We ignore any such symmetry-breaking effects in 
what follows. 

The order parameter for the symmetry breaking is the vector, N, itself, and so the 
group of unbroken transformations is H = U{1) (or, SO{2)), consisting of rotations about 
the axis dehned by (N). The coset space which is parameterized by the Goldstone bosons 
is therefore the space G/H = SU{2)/U{1)^ or S'0(3)/S'0(2). This last way of writing 
G/H identihes it as a two-sphere, 821 since this describes the space swept out by the 
action of rotations on a vector, (N), of hxed length. 

We now have two ways to proceed. We could, on the one hand, follow the steps 
outlined in Ghapter 1 to construct the nonlinear realization oi G/H and its invariant 
lagrangian. Instead we choose here to take a simpler route. As discussed in Ghapter 1, 
the most general possible low-energy Goldstone boson lagrangian must necessarily take 
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the form of eq. (|1.9.1|) : 


I^AF — 




+ (higher-derivative terms), 


(3.1.2) 


where gap{(^) is an S'0(3)-invariant metric on the two-sphere. This form is the most gen¬ 
eral consistent with the nonlinearly-realized SO {3) invariance, as well as with invariance 
with respect to translations, rotations and the time-reversal-like symmetry, T, described 
above.Q The T invariance rnles ont interactions having an odd nnmber of time derivatives, 
snch as the term linear in time derivatives which was constructed in Chapter 1. 


The main point to be made is that the lagrangian given in eq. (|3.1.2|) is unique, a 
result which follows from the uniqueness of the S'O(3)-invariant metric on the two-sphere. 
The uniqueness of this metric is a consequence of the fact that the two broken generators 
of S'0(3)/S'0(2) form an irreducible representation of the unbroken subgroup 50(2) — 
a condition which was shown to imply a unique metric in Chapter 1. Since it is unique, 
any representation of it is equally good and we choose here to use the familiar polar 
coordinates, for the two-sphere, in terms of which the invariant metric has the 

usual expression: = dO'^ + sin^ OdcjP'. With this choice the above lagrangian becomes: 


p2 Tp2 

+ sin^ d 02^ - ^ {ye ■ VO + sin^ d V0 • V 


-|-(higher-derivative terms). 


(3.1.3) 


Alternatively, we can equally well parameterize S 2 using a unit vector, n, where 

^We use translation and rotational invariance for simplicity, even though these are too restrictive for 
real solids, for which only the lattice symmetries should be imposed. For some lattices, such as cubic 
ones, the implications of the lattice group turn out to be the same as what is obtained using rotation and 
translation invariance, at least for those interactions involving the fewest derivatives which are studied 
here. 
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Ux = sin 9 cos 0, Uy = sin 9 sin 0 and = cos 6*, so n ■ n = 1. Then 


F‘^ ■ ■ 

Cap = -^n-n -^Vn-Vn 


+ (higher-derivative terms). 


(3.1.4) 


This variable, n{r, t), makes most clear the physical interpretation of the Goldstone modes: 
they describe long-wavelength variations in the direction of the order parameter (N). 
It has the drawback of hiding the self-interactions which are implied by C^p, since the 
lagrangian of eq. (|3.1.4|) is purely quadratic in n. The self-interactions, which are manifest 
in expression (|3.1.3| ), are nonetheless present, and are hidden in the constraint n ■ n = 1. 

The nonlinear realization of SO{3) transformations on these variables is straight¬ 
forward to work out, starting with the transformation rule for s: 6n = u x n, where 
the vector Co represents the three 50(3) transformation parameters. This implies the 
transformat ions: 


69 = 00yCos(j) — 00xSm(j) 

6(j) = ujz — uJxCot9 cos(j) — ujyCot9sm((). (3.1.5) 

With these transformation laws we may immediately write down the first terms in 
a derivative expansion of the Noether currents, = (p,j), for the 50(3) invariance in 
the low-energy effective theory. They may be most compactly written: 

p = {n X n) + ■ ■ ■ and j (Vn x n) -|- • • •, (3.1.6) 

where the dots are a reminder of the unwritten higher-derivative contributions. 


The other quantities which arise in the general nonlinear realization may also be 
constructed in terms of these variables. For example, the four independent components 





of the covariant quantity (zweibein), are most easily constructed, following the 

geometrical picture of Chapter 1, as the components of any two orthogonal vectors which 
are tangent to the two sphere. These may be found by differentiating the unit vector, f?, 
because the identity n ■ n = 1 implies n ■ 6n = 0, for any variation, 6 n. Denoting these 
two vectors by ee = On/06 and = dn/d(j), we have in cartesian components: 


(e6i)a; = cos6'cos0, (e6))y = cos6'sin(;/), {eg)z = — sinO] 

=-sin Osin (j), (e^)^ = sin^cos^, (e^)^ = 0. (3.1.7) 


Clearly these vectors satisfy ■ e)(, = ■ n = e((, • n = 0, and = 1, ■ e)(, = sin^ 9, so 

Oa ■ Op = Qap, as required. The two-by-two matrix, e“^, of components may be found by 
expressing the two vectors, ep, dehned by eqs. (p.l.7|) , as linear combinations of any two 
orthonormal basis vectors, t", which lie tangent to the sphere: i.e. e'^p = t°‘ ■ ep. Using 
the basis vectors Op themselves for this purpose leads to the result: 


e e e <p 
e^e 


1 0 
0 sin 9 


(3.1.8) 


3.1.2 Physical Applications 


Any physical question that could be asked of the low-energy limit of the underlying theory 
can equally well be addressed using the low-energy effective lagrangian. In particular, the 
lagrangian just derived for the Goldstone modes for a ferromagnet may be used to describe 
the response (at zero or nonzero, but small, temperature) of the system to probes which 
couple to the spin degrees of freedom. 

In order to interpret the constants Fg and T) it is convenient to expand the held 
n, or equivalently 9 and 0, about its vacuum conhguration, uq = (N). We are free to 
perform an SO{3) rotation to choose the direction of no arbitrarily, and so we use this 
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freedom to ensure that hq points up the positive a:-axis. This implies that 6 and 0 take 
the vacuum values, Qq = \ and 0o = 0. Writing the canonically normalized fluctuation 
helds by 0 = I + "d/Fi and (j) = ^p/Ft, the lagrangian becomes: 

^ V-d ■ V-d) + ^ cos^ Vp ■ Vpj 

+ (higher-derivative terms), 


C,. = 


(3.1.9) 


^ (■d^ - V'd ■ Vp ■ Vp) 

(^p^ — Vp ■ V(p) -I- 


^2 


The constant, v, here represents the ratio v = Fg/Ff. The ellipses denote terms that 
involve at least six powers of the helds, or which involve more than two derivatives with 
respect to either position or time. 

The terms quadratic in the helds describe two real modes which propagate according 
to the linear dispersion law: E{p) = vp. These modes physically correspond to spin 
waves', small, long-wavelength precessions of the vector n about its vacuum value, uq. 
They carry ±1 unit of the conserved 50(2) spin in the direction parallel to (N). This 
gives the physical interpretation of the parameter v to be the velocity of propagation of 
these modes. The condition that this velocity must be smaller than the velocity of light is 
n < c = 1 (in fundamental units), or, equivalently, Fg < Ft. Ft is similarly seen to govern 


the strength of the interaction terms in eq. (p.l.9|) . 

These modes and their interactions are amenable to experimental study through 
their electromagnetic couplings. Although magnons carry no electric charge, they do 
couple to magnetic helds, B, due to the interactions of the microscopic magnetic moments 
which participate in the long-wavelength spin waves. This gives a coupling of the magnetic 
held to the medium’s spin density. This coupling can be probed, for example, by scattering 




neutrons which are also electrically neutral but which carry an intrinsic magnetic moment 
which interacts with magnetic helds. 


The interaction between magnons and electromagnetic helds is therefore given by 
a term of the form: £em = s ■ B, where T is the system’s spin density. The lowest- 
dimension effective interaction between magnons and electromagnetic helds is now ob¬ 
tained by expressing the spin density in a derivative expansion, using Noether’s result, 
eq. ( |3.1.6|) , for s = p. The result is: 


■^em Z/ S ■ B, 

= —vFI Bx {6 sin (j) + (j) sin 9 cos 9 cos 0) 

+By (—9 cos (j) + <j) sin 9 cos 9 sin (f)) — B^ (f) sin^ 9) 
= uFf (^By "9 -f- Bz (pj 


(3.1.10) 


where u is an ehective coupling parameter having the dimensions of magnetic moment (or: 
inverse mass, in fundamental units). Notice that the time derivative in this interaction 
ensures invariance with respect to T transformations, under which s —>■ —s and B ——B. 
A nonrelativistic neutron couples to the magnetic held with strength 


C = —/i^ n^an ■ B, 


(3.1.11) 


where n{x) denotes the two-component neutron held, and a denotes the Pauli matrices 
acting in the two-component neutron spin space. The constant is the neutron magnetic 
moment which is, in order of magnitude, ~ e/m^v. As usual e is the electromagnetic 
coupling constant {i.e. the proton charge) and is the neutron (or nucleon) mass. 


Using these interactions, eqs. ( p.l.l0|) and ( p.l.ll|) , the cross section per-unit-volume 
for neutron scattering from the medium can be computed. For slowly-moving neutrons. 











and under the assumption that only the momentum, p', of the scattered neutron is mea¬ 
sured we hnd: 


da _ p') 


(3.1.12) 


Vd'^p' 

Here Ujv "C 1 is the speed of the incoming, nonrelativistic, neutron, and V is the volume 
of the medium whose magnons are responsible for scattering the neutrons. E and E' 
are the energies of the initial and scattered neutrons, and p and p' are their momenta. 
The function, Vij{q), is the magnetic-moment interaction potential (in momentum space) 
which arises from the electromagnetic interaction between the neutron and magnon helds: 


The quantity Sij{uj,q) is the spin correlation function, which contains all of the 
information about the scattering medium which is relevant for analyzing the neutron 
collision. It is dehned by: 


S'ij(a;,q) = / dtdh (si(r, t) Sj(0)) e 


iut-ici-r 


(3.1.14) 


The quantity (sj(r, t) 5^(0)) = Tr[p Sj(r, t) Sj(0)] appearing in here is the expectation 
dehned by the density matrix, p, which characterizes the initial state of the medium with 
which the neutron scatters. Notice that it is not a time-ordered product of operators 
which appears in this expectation. 

This correlation function may be explicitly computed at low energies using the 
previously-derived effective lagrangian which describes the low-energy magnon self-interactions. 
The simplest case is that for which the medium is initially in the no-magnon ground state, 
and where the energies involved are low enough to neglect the magnon self-interactions. 


In this case the spin density may be well-approximated by the hrst terms of eq. (|3.1.6|). 
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Then: 


o / N e e/ I IN 

Oij d[uj - n|q|), 


(3.1.15) 


where v = Fg/Ft is the speed of magnon propagation. 

We see that the cross section has sharp peaks when the nentron energy and mo¬ 
mentum transfers are related by the magnon dispersion relation: E — E' = n|p — p^|. 
This corresponds to inelastic scattering in which the neutron transfers its energy and 
momentum to the medium by creating a magnon. According to eq. (|3.1.15|) the resulting 
peaks in the cross section are inhnitely sharp, but in real systems they have a hnite width 
due to processes which cause the produced magnons to scatter or decay. If the lifetime 
for undisturbed magnon propagation, r = l/T, is much longer than the other interaction 
times of interest in the neutron scattering, then the delta function in eq. (p.1.15 ) becomes 
replaced by the lineshape: 


(5(o;-n|q|) 


(3.1.16) 


2 ti {uj - n|q|)2 -h | P' 

Measurements of the positions and widths of these peaks as functions of the scattered 
neutron energy and momentum can be used to measure the magnon dispersion relation — 
and so the constant v — and its decay rate, T, for the scattering medium. The predicted 
linear spectrum is indeed found when neutrons are scattered from antiferromagnets. The 
situation when neutrons scatter from ferromagnets is different, as we shall now see. 


3.2 Ferromagnetism: T Breaking 

For ferromagnets, the order parameter is simply the total magnetization, or the total 
spin, of the system. Since this dehnes a vector in space, the spin symmetry, G = S'0(3), 
is spontaneously broken to H = SO{2) just as for an antiferromagnet. The low energy 
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behaviour of ferromagnets and antiferromagnets are nevertheless quite different, and this 
difference is due to the fact that time-reversal symmetry is broken in a ferromagnet but 
not in an antiferromagnet. 

We denote by S the order parameter for a ferromagnet, which is related to the spins. 
Si, of the underlying spin model by: 

S = ^s,. (3.2.1) 

i 

Ferromagnets are characterized by having ground states for which there is a nonzero 
expectation for this quantity: (S) 7 ^ 0. 

The action of time-reversal invariance, T, is to reverse the sign of the spin of every 
site. Si —>■ —Si, and so it does the same for the order parameter, S —> — S. The difference 
with the antiferromagnetic case arises because for a ferromagnet it is not possible to 
hnd another broken symmetry which combines with time reversal to preserve S. The 
low-energy effective theory can therefore contain T-violating terms, and this changes the 
properties of its Goldstone bosons in an important way. 

3.2.1 The Nonlinear Realization 

Since the symmetry-breaking pattern for both ferromagnets and antiferromagnets is SO{3) 
SO{2), the nonlinear realization of this symmetry on the Goldstone bosons is identical 
for these two systems. We therefore use the same polar coordinates in this case, 6 and 0, 
as in the previous sections. As before it is convenient to use these to dehne a unit vector, 
denoted by s, with components = sin 6 * cos 0, Sy = sin 6 ^ sin 0 and = cos 9, so that 
s-s = 1. The held, s{r,t), again describes long-wavelength oscillations in the direction of 
(S). 
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The action of SO{3) on these variables is once more given by eq. (|3.1.5|) , and the 
term in the effective lagrangian which involves the fewest spatial derivatives is again 
determined to be: 

= - ^(ve-ve + sin^ e ■ v^). ( 3 . 2 . 2 ) 


The new features appear once the term with the fewest time derivatives is con¬ 
structed. As is discussed in some detail in Chapter 1, this involves only a single time 
derivative because of the broken time-reversal symmetry. It has the form given by 


eq. (| 03 ): 


= -A^{e) r, (3.2.3) 

where the coefficient function, Ao,{0), may be considered to be a gauge field defined on 
the coset space G/H. In Chapter 1 it was determined that the condition that this term 
be G invariant is that A^ must only be G-invariant up to a gauge transformation, in the 
sense that: 




(3.2.4) 


for each generator 50“ = of G on G/H, where 12^(0) are a collection of scalar functions 
on G/H. This last condition is equivalent to the invariance of the field strength for Aa'. 
£^Fai 3 = 0. Our problem is to explicitly construct such a gauge potential for the example 
of interest, G/H = SO{3)/SO{2) = 5*2. 

This construction is quite simple. Since our coset space is two dimensional, it is 
always possible to write the field strength in terms of a scalar field: FaP — 
where eap is the antisymmetric tensor which is constructed using the coset’s G-invariant 
metric. The condition that Fal3 be G invariant is then equivalent to the invariance of B. 
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That is: 


£^B = C daB = 0, (3.2.5) 

which is only possible for all G transformations if is a constant, independent of 0". 

Our solution for Cp^t for a ferromagnet therefore simply boils down to the construc¬ 
tion of a gauge potential for which = B on the two-sphere, S 2 = SO{3)/SO{2). 
But such a gauge potential is very familiar — it is the gauge potential for a magnetic 
monopole positioned at the centre of the two-sphere. The result may therefore be written 
(locally) as: d6°‘ = B cos 6 dcj), and so the corresponding lagrangian is given by 

Cp^t = — B cos 9 0, (3.2.6) 


where is a constant. In terms of the vectors s, ee = ds/dO and = ds/dcj) this may be 
written: 


Cp^t = - B s- (ee X e^). 


(3.2.7) 


The complete Goldstone boson lagrangian containing the fewest time and space 


derivatives is found by combining the contributions of eqs. ( p.2.2|) and (|3.2.6| ), giving: 


Cp = — B cos 6 6 — 


(ve ■ ve + sin^ 0 V0 ■ V 


(3.2.8) 


It is instructive to compute the Noether currents for the 50(3) symmetry that is 
implied by this lagrangian density. The conserved current density is the same as was 
found for the antiferromagnet: 


j = (s X Vs) + ■ ■ ■. 


(3.2.9) 


In computing the corresponding expression for the charge density, it is necessary to keep 
in mind that under these transformations Cp is not invariant, but instead transforms into 
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a total derivative: 




B 


(ujx COS (j) + uiy sin (fj. 


(3.2.10) 


dt sin 9 

Using this in the general expression, eq. (|1.2.3|) , for the Noether current gives the conserved 
charge density: 

p = Bs + ---. (3.2.11) 


The ellipses in this equation, and in eq. ( p.2.9|) , represent more complicated terms which 
are suppressed by additional derivatives. As is easily verihed, the classical equations of 
motion for the lagrangian, (|3.2.8|) , are equivalent to the conservation condition for this 
current: 

(s X V^s) = 0. (3.2.12) 


This equation has long been known to describe long-wavelength spin waves in ferromag- 
nets, and is called the Landau-Lifshitz equation. The constant, k, here is given in terms 
of Fs and B by 

k = -^. (3.2.13) 

Equation (|3.2.11|) brings out a feature of time-reversal breaking systems which is 
qualitatively different from those which preserve time reversal. It states that it is the 
conserved charge density itself, p, which acquires a vacuum expectation value and breaks 
the SO{3) symmetry: 

{^ = B {s) = B So ^0. (3.2.14) 


Clearly the breaking of time reversal (and lorentz invariance) are prerequisites for the 
acquisition of a nonzero ground-state expectation value for p, which is the time component 
of a current. 
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3.2.2 Physical Applications 


The propagation of small-amplitude, long-wavelength spin waves is therefore seen to be 
completely determined by the underlying pattern of spontaneous symmetry breaking: 
SO{3) —>■ 50(2) together with T violation. Linearizing the Landau Lifshitz equation, 


eq. ( p.2.12|) , shows the resulting propagating modes to have the quadratic dispersion 
relation: 

E{p) = kp^. (3.2.15) 


This dispersion relation, and the value of the constant k, can be measured by neutron 
scattering, in a manner that is similar to what was found for antiferromagnets. We 
highlight here only the differences which arise from the antiferromagnetic example. 

The lowest-dimension effective interaction which couples the held s to electromag¬ 
netic helds in the ferromagnetic case is: 


/lem /i S ■ B, 

= —pB [b^ sin 9 cos (j) By sin 0 sin 0 -|- B^ cos 6 ^, 

= —pB Bx — pB [By 5(f) — Bz 56^ + • • ■ , (3.2.16) 

where p is an effective coupling parameter and we have taken the expectation value, (S) 
to point in the positive x direction, so 6 = ^ + 66 and (f> = 5(f>. The constant term, 
independent of 69 and 50, in the hnal line of eqs. (|3.2.16|) gives the interaction energy 
between the magnetic held and the expectation value, (S). This permits the physical 
interpretation of the constant p as the magnetic-moment density of the material. 

This interaction between s and B breaks T invariance, and has the following puzzling 
feature. It does not involve any derivatives of the Goldstone boson helds, 9 and 0, in 

apparent contradiction with the general results of Chapter 1. In fact, the absence of 
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derivatives in eq. (p.2.16 ) is very much like the absence of derivatives in the pion mass 
term. This is because the coupling between s and B relates the internal spin SO{3) 
symmetry to ordinary rotations in space, and so destroys the freedom to consider both 
as separate symmetries. But because rotation invariance is a spacetime symmetry, the 
derivations of Chapter 1 do not directly apply, since these assumed the action of internal 
symmetries from the outset in the transformation rules of the helds. 

Once more taking the neutron coupling to the magnetic held as in eq. ( p.l.ll|) , 
we may compute the cross section for inelastic neutron scattering. For slowly-moving 
neutrons, and under the assumption that only the momentum, p', of the scattered neutron 
is measured we hnd: 


da 






^p(p - P ) Sij{E -E',p- p) 


(3.2.17) 


The variables are the same as for the antiferromagnetic example: and are the speed 

and magnetic moment of the slow incoming neutron, V is the volume of the medium, E 
{E') is the energy of the initial (hnal) neutron, and p (p') are the corresponding momenta. 
The magnetic-moment interaction, Vij{q), is as given in eq. (|3.1.13|) . 

The medium-dependent quantity, Sij{uj, q), once more represents the spin correlation 
function, dehned by eq. (p.l.l4|) . As was the case for the ferromagnet this is dominated 
by sharp peaks when the neutron scatters to produce a magnon, and so has an energy 
and momentum transfer related by the magnon dispersion relation. For a ferromagnet 
this is: E — E' = k{p — p')^. Measurements of these peaks as functions of the scattered 
neutron energy and momentum indeed verihes the quadratic dispersion relation, and can 
be used to measure the constant k. 

A second consequence of the quadratic magnon dispersion relation is the temperature 
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dependence of the magnetization, M = |M|, of a sample at very low temperatures. 
Since the magnon held describes the long-wavelength deviations of the net magnetization 
from its ground state value, the net magnetization at very low temperatures is simply 
proportional to the average magnon occupation number. That is: 

M(0) - M(T) oc M(0) Jd^pn(^^Y (3.2.18) 

where n{E/T) = {exp[E/T] — 1)“^ is the Bose-Einstein distribution. The temperature 
dependence of this result can be determined by changing integration variables from p to 
the dimensionless quantity x = E/T. If E{p) oc p^, for some power, ; 2 , then; 

dp = p^%dEoc dE oc (3.2.19) 

dE 

For z = 2 this predicts [M(0) — M(T)]/M(0) oc in agreement with low-temperature 
observations. 


98 



Chapter 4 

<S'0(5)-Invariance and 
Superconductors 


The techniques described herein have recently proven useful to analyze the consequences 
of a remarkable proposal for the existence of an SO {5) invariance amongst the cuprates 
which exhibit high-temperature superconductivity. This chapter presents a bare-bones 
outline of this proposal, together with a brief summary of the Goldstone-boson properties 
which emerge. 

4.1 SO{b) Symmetry 

Although a proper presentation of the arguments for — and against, since the subject 
remains controversial — the SO{5) proposal is beyond the scope of this review, the form 
of the proposed symmetry itself is easy to state. The starting point is the following 
experimental fact: by performing small adjustments to any of the high-Tc cuprates, it is 
possible to convert them from superconductors into antiferromagnets. This adjustment 
is typically accomplished in practice by altering the ‘doping’, which means that atoms 
having different valences are randomly substituted into a portion of the unit cells of the 
material of interest For example the element Sr might be substituted for the element La in 
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Doping 

Figure 4.1: A typical Temperature-vs-Doping phase diagram for a high-T^ system. 

some fraction, x, of the unit cells. Physically, this substitution has the effect of changing 
the number of charge carriers in the band from which the superconducting electrons are 
taken. 

This basic association of superconductivity and antiferromagnetism suggests a fun¬ 
damental connection between the two. Zhang’s proposal is that — at least over part of 
the theory’s parameter space — these two phases are related by an approximate SO{5) 
symmetry. The action of the symmetry is simplest to state for the order parameters for 
the two phases. 

As we have seen, in §3, the order parameter for the antiferromagnetic (AF) phase is 
simply the direction in space, n, into which the alternating aligned spins point. A nonzero 
value of this order parameter spontaneously breaks the SO {3) ~ SU{2) symmetry of 
rotations amongst electron spins: 

/ ni \ / ni \ 

n2 C>3 ^^2 , (4-1.1) 

{nsj {nsj 

where On denotes an n-by-n real, orthogonal matrix. 

For the superconducting (SC) phase, on the other hand, the order parameter, -ip, is a 
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quantity which carries the quantum numbers of a pair of electrons (or holes). Specifically, 
it has electric charge q = ±2e, if e is the proton charge, and is usually taken to have no 
spin. (The sign of this charge depends on whether the charge carriers are electrons or 
holes.) A nonzero value for ifj signals the breaking of the symmetry of electromagnetic 
phase rotations: V’ ^|J. This symmetry forms the group SO{2) ~ U{1), as is easily 

seen by grouping the real and imaginary parts of "0 into a two-component vector: 


V 0/ 


02 



(4.1.2) 


where 'll: = ij^n + i 'll:!. 

The SO{5) proposal is that the system is approximately invariant under five-by-five 
rotations of all five components of the order parameter. 


/0fl\ 


/0fl\ 

V’/ 



Ui 


ni 

^2 


n2 

\^3/ 


Vng/ 


This symmetry contains electron-spin rotations, eq. (EOl). and electromagnetic phase 
transformations, eq. (|4.1.2|) , as the block-diagonal 50(2) x 50(3) subgroup which acts 
separately on the and the Ua- 

As of this writing, there is a controversy over whether such an approximate symmetry 
actually exists for high-Tc superconductors and, if so, to how large a portion of the phase 
diagram it might apply. Regardless of how this controversy ultimately becomes resolved, 
two points on which everyone must agree are: 


1. For any part of the phase diagram for which approximate 50(5) symmetry holds, 
and which lies within the ordered (AF or SC) phases, there must be a total of four 
Goldstone (or pseudo-Goldstone) bosons. This corresponds to one each for the four 
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S0{5) generators which are broken by a nonzero value for one of the tjjk or Ua- These 
bosons include the usual Goldstone bosons for S'0(3) or SO{2) invariance {e.g., the 
magnons), plus some new pseudo-Goldstone bosons which are consequences only of 
the assumed SO {5) symmetry. 

2. The low-energy properties of these Goldstone and pseudo-Goldstone bosons are com¬ 
pletely dictated by the assumed symmetry-breaking pattern, and are independent 
of the details of whatever microscopic electron dynamics gives rise to the symmetry 
in the first place. These low-energy properties may be efficiently described using an 
effective lagrangian along the lines of those described in Ghapter 1. 

These two properties taken together make an unambiguous detection of the S'0(5) pseudo- 
Goldstone bosons a particularly attractive test of the SO{5) proposal. Their detection 
would be a ‘smoking gun’ for the existence of an extended symmetry like SO{5). Better 
yet, their properties are unambiguously predicted theoretically, without the the usual 
complications which arise when complicated electron dynamics is squeezed into a simple 
theoretical model. We now construct the low-energy effective lagrangian describing these 
pseudo-Goldstone bosons, following the general techniques of the previous sections. 

4.2 The Effective Lagrangian in the Symmetry Limit 

We start with the effective lagrangian in the (idealized) limit where SO{5) is not just 
approximate, but is instead a bona fide symmetry of the system. In this case the lagrangian 
symmetry is G = SO {5), and this is spontaneously broken (by the order parameters we 
are considering) to the subgroup, H = SO {4). We require in this limit the nonlinear sigma 
model for the quotient space G/H = S'0(5)/S'0(4). As discussed in general in §1, the 
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lowest terms in the derivative expansion of the Lagrangian for this system are therefore 
completely determined up to a small number of constants. (Precisely how many constants 
depends on how much symmetry - including crystallographic symmetries - the system 
has. As we describe shortly, more possibilities also arise once explicit S'0(5)-breaking 
interactions are introduced.) 

In this particular case the space S'0(5)/S'0(4) is an old friend: the four-sphere, 54 , 
which is dehned as the points swept out by an arbitrary hve-dimensional vector, N, which 
has unit length: N ■ N = Nf = 1. In terms of such a held, the invariant lagrangian 
obtained using the techniques of previous sections is 

(4.2.1) 


r -k 

mv 2 


2 f2 

dtN -dtN-^-^VN- VN 


(For simplicity of this and later expressions, eq. (|4.2.1|) assumes rotational invariance, 
which is not appropriate for real cuprates. For real systems, the electrons believed re¬ 
sponsible for superconductivity and antiferromagnetism move preferentially along planes 
made up of copper and oxygen atoms. The low-energy lagrangian for such systems is bet¬ 
ter written either in two space dimensions (for Goldstone bosons conhned to the planes), 
or in three dimensions with separate coefficients, fa for each spatial direction, Vq. These 
complications are ignored here, but are discussed in more detail in the original references. 
As is also discussed in these referencse, in specihc dimensions it is sometimes also possible 
to write more invariants than are considered here, such as those which depend on the 
completely antisymmetric tensor, e^-fc..-) 

A convenient parameterization for the four-sphere, and so of our Goldstone bosons, 
is given by polar coordinates: 

' cos (j)' 


N = 


Ur 


Us 


where 


Uq = cos 9 


sm I 


/ sinacos/?' 


n, = sin 6* 


sin a sin/3 | . (4.2.2) 
\ cos a 
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(As usual, care is required to properly handle those points where these coordinates are 
singular). 

Using the standard expression for the round metric on 5*4 then gives the unique 
Goldstone-boson effective lagrangian for a rotational- and time-reversal-invariant system 
in the SO{5) symmetry limit: 


p _ J t 

2 ^ 


{dtOY + cos^ 6{dt4>)‘^ + sin^ 6(^{dta)'^ + sin^ a (d,Pr) 


11 

2 L 


(V6*)^ -I- cos^ 6*(V0)^ -I- sin^ 6(^{'Va)‘^ + sin^ q;(V/5)' 


4.3 Symmetry-Breaking Terms 


Next consider how small S'O (5)-breaking effects can change the low-energy lagrangian. 
We may do so by following the same steps as were taken in §2 to describe the implications 
of quark masses on low-energy pion properties. We do so here in two steps. We hrst 
classify the types of violation of SO{5) symmetry which can arise in real systems. We 
then examine which consequences follow only from S'O(2) x S'O (3) invariance, in order 
to be able to disentangle these from predictions which are specihc to S'0(5). Finally we 
perturb in the various S'O(5) symmetry-breaking parameters to obtain the predictions of 
approximate S'O(5) invariance. By contrasting what is obtained with the result assuming 
only S'O(2) x S'O(3) invariance, the consequences of approximate S'O(5) invariance may 
be found. 

4.3.1 Kinds of Explicit Symmetry Breaking (Qualitative) 

There are several kinds of S'O(5) symmetry breaking which are worth distinguishing from 
one another. These are: 


104 



1. Electromagnetic Interactions: One source of explicit S'0(5) symmetry breaking is 
any couplings to applied macroscopic electromagnetic fields. Electromagnetic gauge 
invariance requires these to be incorporated into the lagrangian using the usual 
precedure of minimal substitution; 

dtip ^ {dt — i q Aq)iI)^ Vip ^ {V — i q A)il). (4.3.1) 

Such couplings necessarily break S'0(5) because they treat the electrically-charged 
components of N differently than the electrically-neutral ones. Although such cou¬ 
plings do not pose any complications of principle, for simplicity’s sake we imagine 
no macroscopic electromagnetic helds to be applied in what follows. 

2. Doping: One of the physical variables on which the phase diagram of the cuprates 
crucially depends is the doping. Since changes in the doping correspond to changes 
in the density of charge carriers amongst the electrons which are relevant for both 
the antiferromagnetism and the superconductivity, it may be described within the 
effective theory by using a chemical potential, p, coupled to electric charge. In 
this way adjustments in p may be chosen to ensure that the system has any given 
experimental charge density. Mathematically a chemical potential is introduced by 
replacing the system Hamiltonian, H, with the quantity H — fiQ, where Q is the 
electric charge. Within the Lagrangian formulation in which we are working, this 
amounts to simply making the replacement Aq ^ Aq + piin the electrostatic scalar 
potential, Aq. 

3. Intrinsic Breaking: The third, and final, category of symmetry-breaking consists of 
everything apart from the previous two. It is known that SO {5) is not an exact 
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symmetry, even with no chemical potential, and in the absence of any applied elec¬ 
tromagnetic helds. It is the interactions (involving the fewest derivatives) in this 
last class of symmetry-breaking terms which we now wish to classify. 


4.3.2 General SO{2) x 5'0(3)-Invariant Interactions 


If 50(5) were not a symmetry at all, then there would be no guarantee that the low- 
energy spectrum should contain particles described by all of the helds a, P, 6 and p. For 
the purposes of later comparison, it is nevertheless useful to ask what kinds of low-energy 
interactions among such states are permitted by 50(2) x 50(3) invariance. 

The most general such Lagrangian involving these four states may be written in 
terms of the helds Ug and Uq, where these helds satisfy the constraint ng-ns + nQ-nQ = 1, 
to the extent that we are interested in only those modes which would be Goldstone or 
pseudo-Goldstone modes in the 50(5) limit. The most general such result, which involves 
at most two derivatives, supplements the invariant expression, eqs. (|4.2.1|) or ([4.2.3| ), with 
the following terms: 


^sh = -G A dtUg ■ dtUQ + B dtUg ■ dtUg + C (uq ■ dtUgf 

-f'^lDVariQ ■ VaUQ + EVaUs ' Varig + F{nQ ■ Vangf 


where ft and fg are the constants appearing in the invariant lagrangian. The quantities 
V, A, B,C, D, E and E are potentially arbitrary functions of the unique 50(2) x 50(3) 
invariant which involves no derivatives: Uq -riQ. (Recall Ug ■ Ug is not independent due to 
the constraint Ug ■ ng + Uq ■ Uq = 1). 

In terms of polar coordinates, and inserting a chemical potential as just described. 


106 








the total effective lagrangian becomes: 


Pv 

c = -1^ + Y 


(l + 2A sin^ 0 + 2B cos^ 9 + 20 sin^ 9 coP 9) (5*0)^ 


+ (1 + 2A) cos^ 9{dt(t) + q+Y + (1 + 2,B) sin^ + sin^ a{dt(5Y'^ 

f? 


f S 

T L 


’(l + 2D sin^ 9+ 2E cos^ 9 + 2F sin^ 9 cos^ 9) {V9Y 
+ (1 + 2D) cos^ 9{V(j))^ + (1 + 2D) sin^ ^((Va)^ + sin^ a(V/3)^) 


+ 


where all coefficient functions, V, A,...etc., are now to be regarded as functions of cos^6'. 

4.3.3 Kinds of Explicit Symmetry Breaking (Quantitative) 


Eq. (I4.3.2D does not yet use any information concerning the nature or size of the explicit 
symmetry breaking (apart from the inclusion of /i). This we must now do if we are 
to quantify the predictions of approximate 50(5) invariance. We do so by making an 
assumption as to how the symmetry-breaking terms transform under 50(5). 

In §2 we saw, for pions, that the quark masses were responsible for explicitly breaking 
the would-be chiral symmetry of the underlying microscopic theory [QCD). Although 
the same reasoning can be applied to 50(5) breaking due to electromagnetic interactions 
and chemical potential dependence, incomplete understanding of the dynamics of the 
microscopic theory so far precludes a similar identification of the other symmetry-breaking 
parameters within some underlying condensed-matter system. For these, we instead are 
forced to make an assumption. 

We therefore assume all 50(5)-breaking terms of the effective lagrangian to be 
proportional to one of two possible quantities: 

1. Chemical Potential: Since we know how the chemical potential appears in the la¬ 
grangian, we know in detail how it breaks 50(5). It does so by an amount which is 
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proportional to the electric charge. For the helds appearing in N, this is represented 
by the £ve-by-£ve electric charge matrix: Q = diag(g, q, 0, 0, 0). 

2. Intrinsic Symmetry Breaking: In the absence of more information, we make the 
simplest assumption for the form taken in the e£ective lagrangian by all other 
microscopic e£ects which explicitly break S'0(5). Since these break S'0(5) to 
S'0(3) X SO{2) we take them to be proportional to a £ve-by-£ve matrix, M, where 
M = e diag (3, 3, —2, —2, —2). Here e 1 is a measure of the quality of the approx¬ 
imation that SO {5) is a symmetry. 

With these choices the Lagrangian is then the most general function of the helds 
N = I^Q and M, subject to the following S'0(5) transformation property 

C{0,N, O.iaQOl, 0,M0l) = C{N, pQ, M), (4.3.2) 

where is an S'0(5) transformation. The implications of the approximate S'0(5) in¬ 
variance may then be extracted by expanding C in powers of the small quantities e and 
fi. Since M and Q always appear premultiplied by these small numbers, this expansion 
restricts the kinds of symmetry breaking which can arise order by order, which in turn 
constrains the possible 6*-dependence of the coefficient functions in C. 

For example, a term in the scalar potential involving 2n powers of N must have the 
following form: 

Ckih,...,k„,ln 

(4.3.3) 

Only even powers of Q enter here due to its antisymmetry, and the term ki = li = 0 is 
excluded from the sums due to the constraint N'^N = 1. Clearly, expanding C to low 


N ■■■ N ■ [eMf-N 
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order in the S'(9(5)-breaking parameters e and /i necessarily also implies keeping only the 
lowest powers of Uq ■ Hq = cos^ 9 in V. 

Similar conclusions may be obtained for the other coefficient functions in the La- 
grangian of eq. ( |4.3.2| ). Working to 0(e^, e/i^, /i^) in V, and to 0(e, /i^) in the two-derivative 
terms then gives: 

^ ■ (4.3.4) 


1 / = l/g + 1/2 cos^ 9 + -V4 cos^ 9 , 


and 


A = Ao + A2 cos^ 9, B = Bq + A2 cos^ 9, C = Co, 
D = Dq + D 2 cos^ 9, E = Eq + D 2 cos^ 9, E = Eq, 


for the coefficient functions in eq. (|4.3.2|) . Notice that the terms proportional to cos^ 6 ^ 
in A and B are identical, as are the corresponding terms in D and E. Expanding in 
powers of e and p, the constants in eqs. ( |4.3.4| ) and ( [f.3.5|) start off linear in e and 
Ai = Afe -|- A^^fE + ■ ■ ■ etc.. The only exceptions to this statement are: Bq, Eq oc e (no 
/i^ term), Cq, Eq oc /i^ (no e term), and V 4 = Furthermore, since 

the jA Uq ■ Uq term in V arises from substituting dt ^ dt — ifiQ in the kinetic term for 


Hq, we have: to leading order. Higher powers of fi originate from terms i 


m 


C which involve more than two derivatives. 


4.4 Pseudo-Goldstone Dispersion Relations 


We now turn to the calculation of the pseudo-Goldstone boson dispersion relations. The 
scalar potential of eq. ([4.3.2|) has three types of extrema: 


( 1 ) 6^0 = 0 or tt; 
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Stt 

Y 


( 4 . 4 . 1 ) 


(2) 9o = ^ or 

(3) 6*0 where c = cos 6 *o satisfies V'{c^) = 0. 


This leads to the four possible phases: (i) SC phase: extremum (1) is a minimum, and 
(2) is a maximum; (ii) AF phase: (2) is a minimum, and (1) is a maximum; {in) MX 
phase: both (1) and (2) are maxima, and (3) is a minimum; or {iv) metastable phase: 
both (1) and (2) are minima, and (3) is a maximum. We focus here purely on the AF 
and SC phases. 


4.4.1 Superconducting Phase 


An expansion about the superconducting mimimum, 6q = 0, gives the dispersion relations 
in this phase for the four bosons. Three of these — 6 *, a and f3 — form a spin triplet of 
pseudo-Goldstone modes for which 


m 


+ £ 


2 


1 

2 


(4.4.2) 


with the phase speed, c^(S'C), and gap, given to lowest order in S'0(5)-breaking 
parameters by: 


= 


1 + 2 


1 + 2(^Eq — + 2(^D2 — A2^ 


£^ = 
'-'SC 


11 
fi 
fl 
f? 

-2V'{1) 

-2{V2 + C 4 ) 

f! ■ 


(B(l)-B(l)) 


(4.4.3) 


In both of these results the first equation uses the general effective theory, eq. ([4.3.2|) , while 
the second equality incorporates the additional information of eqs. (^4.3.4|) and (|4.3.5|). 
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An important part of the SO{5) proposal is that these states have been seen in neutron¬ 
scattering experiments in the superconducting phase of the high-T^ cuprates, even quite 
far away from the antiferromagnetic regime. 

The remaining held, 0, would have been a bona fide gapless Goldstone mode in the 
absence of electromagnetic interactions. Its dispersion relation, E{k) is a more compli¬ 
cated function of c^/c^ and eqfi, whose form is not required here. The quantity c which 
appears with k throughout its dispersion relation is given explicitly by 


cliSC) 


1 + 2{d{1)-A{1)) 

1 -|- 2(^Dq — -|- 2(^D2 — ^ 2 ^ 


4.4.2 Antiferromagnetic Phase 


(4.4.4) 


Expanding about the AF minimum gives the usual two magnons, as in §3, satisfying 


dispersion relation of eq. (|4.4.2|) with: 


clMF) = 5 

Jt 

= ll 

f? 

ElAAF) = 0 . 


1 + 2(e(0)-S(0)) 


1 -|- 2[Eq — 


(4.4.5) 


The remaining two states group into an electrically-charged pseudo-Goldstone state sat¬ 
isfying: 


E±{k) 




±qfi, 


(4.4.6) 


with: 


^pGB 


f2 

{AE) = ^ 
Jt 


1 + 2 


(B(O)-yl(O)) 
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£L = £laB{AF) = 


ll 

ft 

2 ^( 0 ) 

~ir 

fi 


1 + 2(^Dq — 


2 • 


(4.4.7) 


These expressions imply a simple dependence of the gap on the chemical potential: 


Sap = m 


Kjf, 


^sc = 


(4.4.8) 


where := 2Vfe/f+0{ef , k := -2Vf/f+0{e) = q‘^+0{e) and^ := 2V^^/f+0{e). 
Within the AF phase the psendo-Goldstone boson gap is predicted to fall linearly with 




^AF 




(4.4.9) 


where /i^F represents the doping for which one leaves the AF regime. Similarly varies 
qnadratically with fF. 

Robust consequences of S'0(5) invariance are obtained from expressions such as 
these by eliminating the free parameters to obtain relations amongst observables. For 
example, if one eliminates parameters in favour of properties of the gap as a function of 
/i we hnd: 


^ sff) 

4f(o) 

2 

f^AF 


.(o)r 


l-^AF 


2 _ 2 
h'AF h" 


4c(oPt) 2 


/^opt 

2^ 


(/X /^5C-)(^/^opt )? 


opt) 


/^opt 


ifc- + 0{ef. 


(4.4.10) 


Here Popt denotes the chemical potential corresponding to the maximum gap, Ssc 
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Similarly, the phase velocities for all modes in both SC and AF phases are equal to 
one another, and to ft/fg, in the limit of strict SO{5) invariance. (The parameters ft 
and fs may be related to other observables, such as the electric and magnetic screening 
lengths.) It turns out that the 0(e) corrections to this limit are not arbitrary, but also 
satisfy some model-independent relations, which follow by eliminating parameters from 
the above expressions: 

4(30) - cl(AF) = cliSC) - cl(AF) = 0(e). (4.4.11) 

4.5 Summary 

Approximate AO(5) invariance clearly carries real implications for the low-energy exci¬ 
tations of the system. It predicts, in particular, the existence of a spin-triplet pseudo- 
Goldstone state in the SC phase, and an electrically-charged state in the AF phase. 
Furthermore, AO(5) invariance unambiguously relates the properties of these states, like 
their gap and phase velocity, to one another. Better yet, these properties are claimed to 
have been measured in the SC phase, since the spin-triplet state is believed to have been 
observed, with a gap (at optimal doping) of 41 meV. If true, this permits the inference of 
the size of the rough order of the AO(5)-breaking parameter e, and hence to predictions 
for the properties of the hitherto undetected boson in the AF phase. 

It is extremely unlikely that an electrically-charged state having a gap of only ~ 40 
meV can exist deep within the AF phase. Among other things it would make its presence 
felt through the electromagnetic response of these systems in the AF phase. At the very 
least, one can therefore conclude that AO(5) invariance cannot penetrate very far into the 
AF part of the phase diagram, despite its appearance fairly deep in the SC phase. Being 
based purely on pseudo-Goldstone boson properties, this conclusion comes independent 
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of the details of how the underlying electrons are interacting on more microscopic scales. 

There is not yet a consensus as to how uncomfortable this conclusion should make 
one feel about the remarkable SO{5) hypothesis. Either way, robust predictions based on 
the low-energy consequences of symmetries are likely to play a key role in forming any 
such hnal consensus. 
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This review is my personal presentation of the theory of Goldstone bosons, and (apart 
from the SO(5) section) were hrst given as part of a lecture course for the Swiss Cours de 
Troisieme Cycle, in Lausanne and Neuchatel in June 1994. I am indebted to the University 
of Neuchatel, the University of Oslo, Seoul National University, and my colleagues at 
McGill University for providing the forum in which to work out the presentation of these 
ideas. I thank Patrick Labelle and Oscar Hernandez for constructively criticizing them as 
they have evolved. Needless to say, any errors that remain are mine alone. 

The classic early papers which founded this subject are 

1. S. Weinberg, Physical Review Letters 18 (1967) 188; Physical Review 166 (1968) 

1568. 

2. G.G. Gallan, S. Goleman, J. Wess and B. Zumino, Physical Review 177 (1969) 2247. 

My thinking has been strongly influenced by graduate courses given on this subject 
during the early 1980’s in the University of Texas at Austin, by Steven Weinberg. This 
review is, in its presentation, very similar with the logic of his textbook: The Quantum 
Theory of Fields, vols. I and II. 


115 



The following papers and reviews have also shaped how I nnderstand this snbject: 


1. S. Weinberg, Phenomenological Lagrangians, Physica 96A (1979) 327-340. 

2. S. Weinberg, What is Quantum Field Theory, and What Did We Think it is?, 
Talk given at Conference on Historical Examination and Philosophical Reflections 
on the Fonndations of Quantum Field Theory, Boston, MA, 1-3 Mar 1996; ([hep- 
th/970202~7D . 

3. The Quantum Theory of Fields, Volumes I, II: Modern Applications, Steven Wein¬ 
berg, Cambridge University Press, 1995 and 1996. 

4. J. Polchinski, Effective Theory of the Fermi Surface, in the proceedings of the 1992 
Theoretical Advanced Study Institute, Boulder, Colorado, ([hep-th/9210046|) . 

5. W.E. Caswell and G.P. Lepage, Physics Letters B167 (1986) 437; T. Kinoshita and 
G.P. Lepage, in Quantum Electrodynamics, ed. by T. Kinoshita, (World Scientihc, 
Singapore, 1990), pp. 81-89. 

6 . J. Gasser and H. Leutwyler, Chiral Perturbation Theory to One Loop, Annals of 
Physics (NY) 158 (1984) 142. 

7. N. Isgur and M.B. Wise, Weak Decays of Heavy Mesons in the Static Quark Ap¬ 
proximation, Physics Letters B232 (1989) 113. 

8 . S. Weinberg, Nuclear Forces from Chiral Lagrangians, Physics Letters B251 (1990) 
288-292. 


116 
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5.2 Particle Physics Data 

I have taken the experimental values for the section on pions and nucleons from The 
Review of Particle Properties, The European Physical Journal C3, 1 (1998). 

5.3 SO{5) Invariance in High-Tc Superconductors 

It is not the intention of these notes to provide more than a superhcial discussion of the 
SO{5) proposal for the high-T^ cuprates, which was hrst made by Zhang, in 

1 . S.C. Zhang, Science 275 (1997) 1089. 


This remarkable proposal immediately stimulated much interest, with an associated lit¬ 
erature. I do not attempt a literature survey here, apart from listing three highlights in 
which the ‘big picture’ is addressed by several of the held’s major players. These are: 

1. G. Baskaran, P.W. Anderson, On an SO {5) Unification Attempt for the Cuprates, 
c^d-mat/9706076 . 


2. R.B. Laughlin, A Critique of Two Metals, |cond-mat/9709195 


3. P.W. Anderson, G. Baskaran, A Critique of ‘A Critique of Two Metals’, cond- 

mat/97irm . 


The applications of the technique of nonlinear realizations to identify the model- 
independent properties of the resulting Goldstone and pseudo-Goldstone bosons, which 
is followed in Chapter 4, is performed in: 
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2. C.P. Burgess, J.M. Cline, C.A. Liitken, |cond-mat/9801303 


5.4 Condensed Matter Physics for Particle and Nu¬ 
clear Physicists 

There are several sources for condensed matter physics which I have found useful. Some 
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of Modern Physics, ( |cond-mat/930700^ . 


2. Principles of Condensed Matter Physics, P.M. Chaikin and T.C. Lubensky, Cam¬ 
bridge University Press, 1995. 
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